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Diagnostics a nd Therapeutics for Diseases Associated with Dipeptidvlpeptidase IV 
Q)PP4) 

5 Technical field of the invention 

The present invention is in the field of molecular biology, more particularly, the present 
invention relates to nucleic acid sequences and amino acid sequences of a human DPP4 
and its regulation for the treatment of cardiovascular disorders, dermatological disorders, 
10 cancer disorders, hematological disorders, respiratory diseases, gastrointestinal and liver 
diseases, urological disorders and metabolic diseases in mammals. 

Background of the invention 

15 Proteases play a role in carefully controlled processes, such as blood coagulation, 
fibrinolysis, complement activation, fertilization, and hormone production. These 
enzymes are also used in a variety of diagnostic, therapeutic, and industrial contexts. 
DPP4 is a member of the group of protease enzymes Abbott et al. (1994), Marguet et al. 
(2000), Misumi et al. (1992), US 5587299,US 5965373, WO200179473, US 6337069, 

20 WO9316102. 

Proteases were recognized very early in the history of biochemistry. In the nineteenth 
century, one primary focus of research was on digestive proteases, like pepsin and 
trypsin. Proteases belong systematically to the C-N Hydrolases. More specifically, 
25 proteases catalyze the hyprolytic cleavage of a peptide bond and are therefore called 
peptidases as well. 

Proteases can be classified according to several criteria, e.g. by localisation. Digestive 
proteases are located in the gastro-intestinal tract. These proteases are responsible for the 
30 digestion of food proteins. 
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Peptidases located extracellularly in the blood or other extracellular compartments of 
the body play often regulatory roles in processes like for example blood clotting, 
fibrinolysis, or activation of complement constituents. 

5 Intracellular^ located proteases exhibit a wide variety of roles. They are found in 
compartiments like the ER, the Golgi apparatus, or the lysomes. Their functions include 
for example activation of peptide hormons, ubiquitin mediated proteolysis, among 
others. 

10 Proteases are most commonly classified according to their mechanism of action, or to 
specific active groups that are present in the active center. The following groups can be 
distinguished: 

1. Serin-peptidases, 2. cystein-peptidases, 3. aspartyl- or acidic-peptidases, 4. metallo- 
1 5 peptidases, or 5. peptidases with yet unclear reaction mechanism. 

Serine peptidases 

Serine proteases exhibit a serine in the catalytic site which forms a covalent ester 
20 intermediate during the catalytic reaction pathway by a nucleophilic attack on the 
carboxy carbon of the peptide bond. In the active site of serine proteases a catalytic triad 
comprised of an aspartate, a histidine and the above mentioned serine is found. This 
triad functions in the reaction mechanism as a charge relay system. 

25 To the large family of serine protease belong, for example, the digestive enzymes trypsin 
and chymotrypsin, components of the complement cascade, enzymes involved in the 
blood clotting cascade, as well as enzymes that function in degradation, rebuilding and 
maintenace of constituents of the extracellular matrix. 

30 One feature of the serine protease family is the broad range of substrate specificity. 
Members of the trypase subgroup cleave after arginine or lysine, chymases after 
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phenylalanine or leucine, aspases after aspartate, metases after methionine and serases 
after serine. 

Cysteine proteases 

5 

During the catalytic reaction of cysteine proteases a covalent thioester intermediate is 
formed by a nucleophilic attack of the cysteine on the caboxy carbon of the peptide 
bond. Similar to the serine peptidases a catalytic triad comprised of the cysteine, a 
histidine and an asparagine is found which functions as a charge relay system to 
10 facilitate the formation of the thioester intermediate. 

Members of the Cysteine protease family have roles in many different cellular processes, 
e.g. processing of precursers or intracellular degradation. Examples for cysteine 
proteases include lysosomal cathepsines, and cytosolic calpains. 

15 

Aspartyl- or acidic peptidase 

The catalytic site of aspartyl proteases is composed of two aspartate residues. At the pH 
optimum of aspartyl proteases (2-3) one of the aspartyl carboxy groups is ionized and 
20 the other is neutral, which is important for the catalytic reaction to occur. Examples for 
aspartyl proteases are gastric pepsins A and C, chymosin, as well as mammalian renin. 

Metallo-peptidases 

25 Metallo-peptidases are proteases, whose proteolytic activity depends on the presence of 
divalent cations in the active center. Examples of members of this class are 
carboxypeptidase A, which represents a pancreatic digestive enzyme, the Angiotensin 
Converting Enzymes (ACE), which are responsible for the conversion of angiotensin I to 
angiotensin II, or the Extracellular Matrix Metalloprotienases. 

30 

In summary, a huge number of proteases play a central role in several important cellular 
and intracellular processes. Furthermore, the value as pharmaceutical targets has been 
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proven for several proteases. For example, the protease encoded by the HIV genome is 
used as a target for drugs for the treatment of HIV infections, the proteasom complex has 
been discovered as a n a nti-cancer t arget, o r C ys-proteases h ave b een i mplemented a s 
drug targets for inflammatory disorders. Selective inhibitors have been developed as 

5 therapeutic agents for diseases such as HTV. Thus, the identification of further disease 
implications of protease species and their splice variants may lead to the development of 
specific inhibitors or modulators, or suggest new utilities for known compounds 
affecting proteases. That in turn will provide additional pharmacological approaches to 
treat diseases and conditions in which protease activities are involved. This diseases may 

10 include, but are not limited to, infections such as bacterial, fungal, protozoan, and viral 
infections, particularly those caused by HTV viruses, cancers, allergies including asthma, 
cardiovascular diseases including acute heart failure, hypotension, hypertension, angina 
pectoris, myocardial infarction, hematological diseases, genito-urinary diseases 
including urinary incontinence and benign prostate hyperplasia, osteoporosis, and 

15 peripheral and central nervous system disorders including pain, Alzheimer's disease and 
Parkinson's disease. 

TaqMan-Technology / expression profiling 

20 TaqMan is a recently developed technique, in which the release of a fluorescent reporter 
dye from a hybridisation probe in real-time during a polymerase chain reaction (PCR) is 
proportional to the accumulation o f the PCR product. Quantification is based on the 
early, linear part of the reaction, and by determining the threshold cycle (CT), at which 
fluorescence above background is first detected. 

25 

Gene expression technologies may be useful in several areas of drug discovery and 
development, such as target identification, lead optimization, and identification of 
mechanisms of action. The TaqMan technology can be used to compare differences 
between expression profiles of normal tissue and diseased tissue. Expression profiling 
.30 has been used in identifying genes, which are up- or downregulated in a variety of 
diseases. An interesting application of expression profiling is temporal monitoring of 
changes in gene expression during disease progression and drug treatment or in patients 
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versus healthy individuals. The premise in this approach is that changes in pattern of 
gene expression in response to physiological or environmental stimuli (e.g., drugs) may 
serve as indirect clues about disease-causing genes or drug targets. Moreover, the 
effects of drugs with established efficacy on global gene expression patterns may 
5 provide a guidepost, or a genetic signature, against which a new drug candidate can be 
compared. 

DPP4 

10 The nucleotide sequence of DPP4 is accessible in public databases by the accession 
number NM_001935 and is given in SEQ ID NO:l . The amino acid sequence of DPP4 is 
depicted in SEQ ID NO:2. 

Misumi et al. (1992) cloned a cDNA coding for the human dipeptidyl peptidase IV 
15 (DPPIV). 

Abbott et al. (1994) demonstrated that CD26 spans approximately 70 kb and contains 26 
exons, ranging in size from 45 bp to 1.4 kb. The nucleotides that encode the serine 
recognition site (G-W-S-Y-G) are split between 2 exons. This clearly distinguishes the 

20 genomic organization of the prolyl oligopeptidase family from that of the classic serine 
protease family. CD26 encodes 2 messages sized at about 4.2 and 2.8 kb. These are both 
expressed at high levels in the placenta and kidney and at moderate levels in the lung 
and liver. Only the 4.2 kb mRNA was expressed at low levels in skeletal muscle, heart, 
brain, and pancreas. By fluorescence in situ hybridization, Abbott et al. (1994) mapped 

25 the gene to 2q24.3. 

Marguet et al. (2000) presented data indicating a critical role for CD26 in physiologic 
glucose homeostasis, and established it as a potential target for therapy in type II 
diabetes. Dipeptidyl peptidase IV enzyme activity has been implicated in the regulation 
30 of the biologic activity of multiple hormones and chemokines, including the 
insulinotropic peptides glucagon-like peptide- 1 (GLP1) and glucose-dependent 
insulinotropic polypeptide (GIP; 137240). Marguet et al. (2000) found that targeted 
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inactivation of the CD26 gene yielded healthy mice that had normal blood glucose levels 
in the fasted state, but reduced glycemic excursion after a glucose challenge. Levels of 
glucose-stimulated circulating insulin and the intact insulinotropic form of GLP1 were 
increased in CD26 -A mice. A pharmacologic inhibitor of DPP IV enzymatic activity 
5 improved glucose tolerance in wildtype, but not in CD26 -A, mice. This inhibitor also 
improved glucose tolerance in GLP1 receptor (GLP1R) -A mice, indicating that CD26 
contributes to blood glucose regulation by controlling the activity of GLP1 as well as 
additional substrates 

10 DPP4 is published in US 5587299, US 5965373 and US 6337069, WO9316102, 
WO200179473. DPP4 shows the highest homology (51%) to the human fibroblast 
activation protein, alpha as shown in example 1. 

Summary of the invention 

15 

The invention relates to novel disease associations of DPP4 polypeptides and 
polynucleotides. The invention also relates to novel methods of screening for 
therapeutic agents for the treatment of cardiovascular disorders, dermatological 
disorders, cancer disorders, hematological disorders, respiratory diseases, 

20 gastrointestinal and liver diseases, urological disorders and metabolic diseases in a 
mammal. The invention also relates to pharmaceutical compositions for the treatment of 
cardiovascular disorders, dermatological disorders, cancer disorders, hematological 
disorders, respiratory diseases, gastrointestinal and liver diseases, urological disorders 
and metabolic diseases in a mammal comprising a DPP4 polypeptide, a DPP4 

25 polynucleotide, or regulators of DPP4 or modulators of DPP4 activity. The invention 
further comprises methods of diagnosing cardiovascular disorders, dermatological 
disorders, cancer disorders, hematological disorders, respiratory diseases, 
gastrointestinal and liver diseases, urological disorders and metabolic diseases in a 
mammal. 



30 
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Brief Description of the Drawings 

Fig. 1 shows the nucleotide sequence of a DPP4 polynucleotide (SEQ ID NO: 1). 
fig. 2 shows the amino acid sequence of a DPP4 polypeptide (SEQ ID NO:2). 
5 Fig. 3 shows the nucleotide sequence of a primer useful for the invention (SEQ ID 
NO:3). 

Fig. 4 shows the nucleotide sequence of a primer useful for the invention (SEQ ID 
NO:4). 

Fig. 5 shows a nucleotide sequence useful as a probe to detect proteins of the invention 
10 (SEQIDNO:5). 

Detailed description of the invention 

Definition of terms 

15 

An "oligonucleotide" is a stretch of nucleotide residues which has a sufficient number of 
bases to be used as an oligomer, amplimer or probe in a polymerase chain reaction 
(PCR). Oligonucleotides are prepared from genomic or cDNA sequence and are used to 
amplify, reveal, or confirm the presence of a similar DNA or RNA in a particular cell or 
20 tissue. Oligonucleotides or oligomers comprise portions of a DNA sequence having at 
least about 10 nucleotides and as many as about 35 nucleotides, preferably about 25 
nucleotides. 

"Probes" may be derived from naturally occurring or recombinant single- or double- 
25 stranded nucleic acids or may be chemically synthesized. They are useful in detecting 
the presence of identical or similar sequences. Such probes may be labeled with reporter 
molecules using nick translation, Klenow fill-in reaction, PCR or other methods well 
known in the art. Nucleic acid probes may be used in southern, northern or in situ 
hybridizations to determine whether DNA or RNA encoding a certain protein is present 
30 in a cell type, tissue, or organ. 
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A "fragment of a polynucleotide" is a nucleic acid that comprises all or any part of a 
given nucleotide molecule, the fragment having fewer nucleotides than about 6 kb, 
preferably fewer than about 1 kb. 

5 "Reporter molecules" are radionuclides, enzymes, fluorescent, chemiluminescent, or 
chromogenic agents which associate with a particular nucleotide or amino acid 
sequence, thereby establishing the presence of a certain sequence, or allowing for the 
quantification of a certain sequence. 

10 "Chimeric" molecules may be constructed by introducing all or part of the nucleotide 
sequence of this invention into a vector containing additional nucleic acid sequence 
which might be expected to change any one or several of the following DPP4 
characteristics: cellular location, distribution, ligand-binding affinities, interchain 
affinities, degradation/turnover rate, signaling, etc. 

15 

"Active", with respect to a DPP4 polypeptide, refers to those forms, fragments, or 
domains of a DPP4 polypeptide which retain the biological and/or antigenic activity of a 
DPP4 polypeptide. 

20 'Naturally occurring DPP4 polypeptide" refers to a polypeptide produced by cells which 
have not been genetically engineered and specifically contemplates various polypeptides 
arising from post-translational modifications of the polypeptide including but not limited 
to acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 

25 "Derivative" refers to polypeptides which have been chemically modified by techniques 
such as ubiquitination, labeling (see above), pegylation (derivatization with polyethylene 
glycol), and chemical insertion or substitution of amino acids such as ornithine which do 
not normally occur in human proteins. 

30 "Conservative amino acid substitutions" result from replacing one amino acid with 
another having similar structural and/or chemical properties, such as the replacement of 
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a leucine with an isoleucine or valine, an aspartate with a glutamate, or a threonine with 
a serine. 

"Insertions" or "deletions" are typically in the range of about 1 to 5 amino acids. The 
5 variation allowed may be experimentally determined by producing the peptide 
synthetically while systematically making insertions, deletions, or substitutions of 
nucleotides in the sequence using recombinant DNA techniques. 

A "signal sequence" or "leader sequence" can be used, when desired, to direct the 
1 0 polypeptide through a membrane of a cell. Such a sequence may be naturally present on 
the polypeptides of the present invention or provided from heterologous sources by 
recombinant DNA techniques. 

An "oligopeptide" is a short stretch of amino acid residues and may be expressed from 
15 an oligonucleotide. Oligopeptides comprise a stretch of amino acid residues of at least 
3, 5, 10 amino acids and at most 10, 15, 25 amino acids, typically of at least 9 to 13 
amino acids, and of sufficient length to display biological and/or antigenic activity. 

"Inhibitor** is any substance which retards or prevents a chemical or physiological 
20 reaction or response. Common inhibitors include but are not limited to antisense 
molecules, antibodies, and antagonists. 

"Standard expression" is a quantitative or qualitative measurement for comparison. It is 
based on a statistically appropriate number of normal samples and is created to use as a 
25 basis of comparison when performing diagnostic assays, running clinical trials, or 
following patient treatment profiles. 

"Animal" as used herein may be defined to include human, domestic (e.g., cats, dogs, 
etc.), agricultural (e.g., cows, horses, sheep, etc.) or test species (e.g., mouse, rat, rabbit, 
30 etc.). 
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A "DPP4 polynucleotide", within the meaning of the invention, shall be understood as 
being a nucleic acid molecule selected from a group consisting of 

(i) nucleic acid molecules encoding a polypeptide comprising the amino acid 
5 sequence of SEQ ID NO: 2, 

(ii) nucleic acid molecules comprising the sequence of SEQ ID NO: 1, 

(iii) nucleic acid molecules having the sequence of SEQ ID NO: 1 , 

10 

(iv) nucleic acid molecules the complementary strand of which hybridizes under 
stringent conditions to a nucleic acid molecule of (i), (ii), or (iii); and 

(v) nucleic acid molecules the sequence of which differs from the sequence of a 
1 5 nucleic acid molecule of (iii) due to the degeneracy of the genetic code; 

wherein the polypeptide encoded by said nucleic acid molecule has DPP4 activity. 

A "DPP4 polypeptide", within the meaning of the invention, shall be understood as 
20 being a polypeptide selected from a group consisting of 



(i) polypeptides having the sequence of SEQ ID NO: 2, 

(ii) polypeptides comprising the sequence of SEQ ED NO: 2, 

(iii) polypeptides encoded by DPP4 polynucleotides; and 



25 



30 



(iv) polypeptides which show at least 99%, 98%, 95%, 90%, or 80% homology with 
a polypeptide of (i), (ii), or (iii); 

wherein said polypeptide has DPP4 activity. 
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The nucleotide sequences encoding a DPP4 (or their complement) have numerous 
applications in techniques known to those skilled in the art of molecular biology. These 
techniques include use as hybridization probes, use in the construction of oligomers for 
PCR, use for chromosome and gene mapping, use in the recombinant production of 

5 DPP4, and use in generation of antisense DNA or RNA, their chemical analogs and the 
like. Uses of nucleotides encoding a DPP4 disclosed herein are exemplary of known 
techniques and are not intended to limit their use in any technique known to a person of 
ordinary skill in the art Furthermore, the nucleotide sequences disclosed herein may be 
used in molecular biology techniques that have not yet been developed, provided the 

10 new techniques rely on properties of nucleotide sequences that are currently known, e.g., 
the triplet genetic code, specific base pair interactions, etc. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 
genetic code, a multitude of DPP4 - encoding nucleotide sequences may be produced. 

15 Some of these will only bear minimal homology to the nucleotide sequence of the 
known and naturally occurring DPP4. The invention has specifically contemplated each 
and every possible variation of nucleotide sequence that could be made by selecting 
combinations based on possible codon choices. These combinations are made in 
accordance with the standard triplet genetic code as applied to the nucleotide sequence 

20 of naturally occurring DPP4, and all such variations are to be considered as being 
specifically disclosed. 

Although the nucleotide sequences which encode a DPP4, its derivatives or its variants 
are preferably capable of hybridizing to the nucleotide sequence of the naturally 

25 occurring D PP4 p olynucleotide u nder s tringent c onditions, i t m ay be advantageous to 
produce nucleotide sequences encoding DPP4 polypeptides or its derivatives possessing 
a substantially different codon usage. Codons can be selected to increase the rate at 
which expression of the peptide occurs in a particular prokaryotic or eukaryotic 
expression host in accordance with the frequency with which particular codons are 

30 utilized by the host. Other reasons for substantially altering the nucleotide sequence 
encoding a DPP4 polypeptide and/or its derivatives without altering the encoded amino 
acid sequence include the production of RNA transcripts having more desirable 
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properties, such as a greater half-life, than transcripts produced from the naturally 
occurring sequence. 

Nucleotide sequences encoding a DPP4 polypeptide may be joined to a variety of other 
5 nucleotide sequences by means of well established recombinant DNA techniques. 
Useful nucleotide sequences for joining to DPP4 polynucleotides include an assortment 
of cloning vectors such as plasmids, cosmids, lambda phage derivatives, phagemids, and 
the like. Vectors of interest include expression vectors, replication vectors, probe 
generation vectors, sequencing vectors, etc. In general, vectors of interest may contain 
10 an origin of replication functional in at least one organism, convenient restriction endo- 
nuclease sensitive sites, and selectable markers for one or more host cell systems. 

Another aspect of the subject invention is to provide for DPP4-specific hybridization 
probes capable of hybridizing with naturally occurring nucleotide sequences encoding 

15 DPP4. Such probes may also be used for the detection of similar protease encoding 
sequences and should preferably show at least 40% nucleotide identity to DPP4 
polynucleotides. The hybridization probes of the subject invention maybe derived from 
the nucleotide sequence presented as SEQ ID NO: 1 or from genomic sequences 
including promoter, enhancers or introns of the native gene. Hybridization probes may 

20 be labelled by a variety of reporter molecules using techniques well known in the art. 

It will be recognized that many deletional or mutational analogs of DPP4 poly- 
nucleotides will be effective hybridization probes for DPP4 polynucleotides. 
Accordingly, the invention relates to nucleic acid sequences that hybridize with such 
25 DPP4 encoding nucleic acid sequences under stringent conditions. 

"Stringent conditions" refers to conditions that allow for the hybridization of 
substantially related nucleic acid sequences. For instance, such conditions will generally 
allow hybridization of sequence with at least about 85% sequence identity, preferably 
30 with at least about 90% sequence identity, more preferably with at least about 95% 
sequence identity. Hybridization conditions and probes can be adjusted in well- 
characterized ways to achieve selective hybridization of human-derived probes. 
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Stringent conditions, within the meaning of the invention are 65°C in a buffer containing 
1 mM EDTA, 0.5 M NaHP0 4 (pH 7.2), 7 % (w/v) SDS. 

Nucleic acid molecules that will hybridize to DPP4 polynucleotides under stringent 
5 conditions can be identified functionally. Without limitation, examples of the uses for 
hybridization probes include: histochemical uses such as identifying tissues that express 
DPP4; measuring mRNA levels, for instance to identify a sample's tissue type or to 
identify cells that express abnormal levels of DPP4; and detecting polymorphisms of 
DPP4. 

10 

PCR provides additional uses for oligonucleotides based upon the nucleotide sequence 
which encodes DPP4. Such probes used in PCR may be of recombinant origin, 
chemically synthesized, or a mixture of both. Oligomers may comprise discrete 
nucleotide sequences employed under optimized conditions for identification of DPP4 in 
15 specific tissues or diagnostic use. The same two oligomers, a nested set of oligomers, or 
even a degenerate pool of oligomers may be employed under less stringent conditions 
for identification of closely related DNAs or RNAs. 

Rules for designing polymerase chain reaction (PCR) primers are now established, as 
20 reviewed by PCR Protocols. Degenerate primers, i.e., preparations of primers that are 
heterogeneous at given sequence locations, can be designed to amplify nucleic acid 
sequences that are highly homologous to, but not identical with DPP4. Strategies are 
now available that allow for only one of the primers to be required to specifically 
hybridize with a known sequence. For example, appropriate nucleic acid primers can be 
25 ligated to the nucleic acid sought to be amplified to provide the hybridization partner for 
one of the primers. In this way, only one of the primers need be based on the sequence 
of the nucleic acid sought to be amplified. 

PCR methods for amplifying nucleic acid will utilize at least two primers. One of these 
30 primers will be capable of hybridizing to a first strand of the nucleic acid to be amplified 
and of priming enzyme-driven nucleic acid synthesis in a first direction. The other will 
be capable of hybridizing the reciprocal sequence of the first strand (if the sequence to 
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be amplified is single stranded, this sequence will initially be hypothetical, but will be 
synthesized in the first amplification cycle) and of priming nucleic acid synthesis from 
that strand in the direction opposite the first direction and towards the site of 
hybridization for the first primer. Conditions for conducting such amplifications, 
5 particularly under preferred stringent hybridization conditions, are well known. 

Other means of producing specific hybridization probes for DPP4 include the cloning of 
nucleic acid sequences encoding DPP4 or DPP4 derivatives into vectors for the 
production of mKNA probes. Such vectors are known in the art, are commercially 
10 available and may be used to synthesize RNA probes in vitro by means of the addition of 
the appropriate RNA polymerase as T7 or SP6 RNA polymerase and the appropriate 
reporter molecules. 

It is possible to produce a DNA sequence, or portions thereof, entirely by synthetic 
15 chemistry. A fter synthesis, the nucleic acid sequence can be inserted into any of the 
many available DNA vectors and their respective host cells using techniques which are 
well known in the art. Moreover, synthetic chemistry may be used to introduce 
mutations into the nucleotide sequence. Alternately, a portion of sequence in which a 
mutation is desired can be synthesized and recombined with longer portion of an 
20 existing genomic or recombinant sequence. 

DPP4 polynucleotides may be used to produce a purified oligo-or polypeptide using well 
known methods of recombinant DNA technology. The oligopeptide may be expressed 
in a variety of host cells, either prokaryotic or eukaryotic. Host cells may be from the 
25 same species from which the nucleotide sequence was derived or from a different 
species. Advantages of producing an oligonucleotide by recombinant DNA technology 
include obtaining adequate amounts of the protein for purification and the availability of 
simplified purification procedures. 
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Ouantitative determinations of nucleic acids 

An important step in the molecular genetic analysis of human disease is often the 
enumeration of the copy number of a nucleis acid or the relative expression of a gene in 
5 particular tissues. 

Several different approaches are currently available to make quantitative determinations 
of nucleic acids; Chromosome-based techniques, such as comparative genomic 
hybridization (CGH) and fluorescent in situ hybridization (FISH) facilitate efforts to 

10 cytogenetically localize genomic regions that are altered in tumor cells. Regions of 
genomic alteration can be narrowed further using loss of heterozygosity analysis (LOH), 
in which disease DNA is analyzed and compared with normal DNA for the loss of a 
heterozygous polymorphic marker. The first experiments used restriction fragment 
length polymorphisms (RFLPs) [Johnson, (1989)], or hypervariable minisatellite DNA 

15 [Barnes, 2000]. In recent years LOH has been performed primarily using PCR 
amplification of microsatellite markers and electrophoresis of the radio labelled 
[Jeffreys, (1985)] or fluorescently labelled PCR products [Weber, (1990)] and compared 
between paired normal and disease DNAs. 

20 A number of other methods have also been developed to quantify nucleic acids [Gergen, 
(1992)]. More recently, PCR and RT-PCR methods have been developed which are 
capable of measuring the amount of a nucleic acid in a sample. One approach, for 
example, measures PCR product quantity in the log phase of the reaction before the 
formation of reaction products plateaus [Thomas, (1980)]. 

25 . 

A gene sequence contained in all samples at relatively constant quantity is typically 
utilized for sample amplification efficiency normalization. This approach, however, 
suffers from several drawbacks. The method requires that each sample has equal input 
amounts of the nucleic acid and that the amplification efficiency between samples is 
30 identical until the time of analysis. Furthermore, it is difficult using the conventional 
methods of PCR quantitation such as gel electrophoresis or plate capture hybridization 
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to determine that all samples are in fact analyzed during the log phase of the reaction as 
required by the method. 

Another method called quantitative competitive (QC)-PCR, as the name implies, relies 
5 on the inclusion of an internal control competitor in each reaction [Piatak, (1993), 
BioTechniques]. The efficiency of each reaction is normalized to the internal 
competitor. A known amount of internal competitor is typically added to each sample. 
The unknown target PCR product is compared with the known competitor PCR product 
to obtain relative quantitation. A difficulty with this general approach lies in developing 
1 0 an internal control that amplifies with the same efficiency than the target molecule. 

5' Fluorogenic Nuclease Assays 

Fluorogenic nuclease assays are a real time quantitation method that uses a probe to 
1 5 monitor formation of amplification product. The basis for this method of monitoring the 
formation of amplification product is to measure continuously PCR product 
accumulation using a dual-labelled fluorogenic oligonucleotide probe, an approach 
frequently referred to in the literature simply as the 4 TaqMan method" [Piatak,(1993), 
Science; Heid, (1996); Gibson, (1996); Holland. (1991)]. 

20 

The probe used in such assays is typically a short (about 20-25 bases) oligonucleotide 
that is labeled with two different fluorescent dyes. The 5' terminus of the probe is 
attached to a reporter dye and the 3* terminus is attached to a quenching dye, although 
the dyes could be attached at other locations on the probe as well. The probe is designed 

25 to have at least substantial sequence complementarity with the probe binding site. 
Upstream and downstream PCR primers which bind to flanking regions of the locus are 
added to the reaction mixture. When the probe is intact, energy transfer between the two 
fluorophors occurs and the quencher quenches emission from the reporter. During the 
extension phase of PCR, the probe is cleaved by the 5' nuclease activity of a nucleic acid 

30 polymerase such as Taq polymerase, thereby releasing the reporter from the 
oligonucleotide-quencher and resulting in an increase of reporter emission intensity 
which can be measured by an appropriate detector. 
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One detector which is specifically adapted for measuring fluorescence emissions such as 
those created during a fluorogenic assay is the ABI 7700 or 4700 HT manufactured by 
Applied Biosystems, Inc. in Foster City, Calif. The ABI 7700 uses fiber optics 
5 connected with each well in a 96-or 384 well PCR tube arrangement. The instrument 
includes a laser for exciting the labels and is capable of measuring the fluorescence 
spectra intensity from each tube with continuous monitoring during PCR amplification. 
Each tube is re-examined every 8.5 seconds. 

10 Computer software provided with the instrument is capable of recording the 
fluorescence intensity of reporter and quencher over the course of the amplification. 
The recorded values will then be used to calculate the increase in normalized reporter 
emission intensity on a continuous basis. The increase in emission intensity is plotted 
versus time, i.e., the number of amplification cycles, to produce a continuous measure of 

15 amplification. To quantify the locus in each amplification reaction, the amplification 
plot is examined at a point during the log phase of product accumulation. This is 
accomplished by assigning a fluorescence threshold intensity above background and 
determining the point at which each amplification plot crosses the threshold (defined as 
the threshold cycle number or Ct). Differences in threshold cycle number are used to 

20 quantify the relative amount of PCR target contained within each tube. Assuming that 
each reaction functions at 100% PCR efficiency, a difference of one Ct represents a two- 
fold difference in the amount of starting template. The fluorescence value can be used 
in conjunction with a standard curve to determine the amount of amplification product 
present. 

25 

Non-Probe~Based Detection Methods 

A variety of options are available for measuring the amplification products as they are 
formed. One method utilizes labels, such as dyes, which only bind to double stranded 
30 DNA. In this type of approach, amplification product (which is double stranded) binds 
dye molecules in solution to form a complex. With the appropriate dyes, it is possible to 
distinguish between dye molecules free in solution and dye molecules bound to 
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amplification product. For example, certain dyes fluoresce only when bound to 
amplification product. Examples of dyes whiGh can be used in methods of this general 
type include, but are not limited to, Syber Green.TM. and Pico Green from Molecular 
Probes, Inc. of Eugene, Oreg., ethidium bromide, propidium iodide, chromomycin, 
5 acridine orange, Hoechst 33258, Toto-1, Yoyo-1, DAPI (4\6-diamidino-2-phenylindole 
hydrochloride). 

Another real time detection technique measures alteration in energy fluorescence energy 
transfer between fluorophors conjugated with PCR primers [Livak, (1995)]. 

10 

Probe-Based Detection Methods 



These detection methods involve some alteration to the structure or conformation of a 
probe hybridized to the locus between the amplification primer pair. In some instances, 
15 the alteration is caused by the template-dependent extension catalyzed by a nucleic acid 
polymerase during the amplification process. The alteration generates a detectable 
signal which is an indirect measure of the amount of amplification product formed. 

For example, some methods involve the degradation or digestion of the probe during the 
20 extension reaction. These methods are a consequence of the 5'-3* nuclease activity 
associated with some nucleic acid polymerases. Polymerases having this activity cleave 
mononucleotides or small oligonucleotides from an oligonucleotide probe annealed to 
its complementary sequence located within the locus. 

25 The y end of the upstream primer provides the initial binding site for the nucleic acid 
polymerase. As the polymerase catalyzes extension of the upstream primer and 
encounters the bound probe, the nucleic acid polymerase displaces a portion of the 5' 
end of the probe and through its nuclease activity cleaves mononucleotides or 
oligonucleotides from the probe. 

30 

The upstream primer and the probe can be designed such that they anneal to the 
complementary strand in close proximity to one another. In fact, the 3' end of the 
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upstream primer and the 5 f end of the probe may abut one another. In this situation, 
extension of the upstream primer is not necessary in order for the nucleic acid 
polymerase to begin cleaving the probe. In the case in which intervening nucleotides 
separate the upstream primer and the probe, extension of the primer is necessary before 
5 the nucleic acid polymerase encounters the 5 1 end of the probe. Once contact occurs and 
polymerization continues, the 5'-y exonuclease activity of the nucleic acid polymerase 
begins cleaving mononucleotides or oligonucleotides from the 5' end of the probe. 
Digestion of the probe continues until the remaining portion of the probe dissociates 
from the complementary strand. 

10 

In solution, the two end sections can hybridize with each other to form a hairpin loop. In 
this conformation, the reporter and quencher dye are in sufficiently close proximity that 
fluorescence from the reporter dye is effectively quenched by the quencher dye. 
Hybridized probe, in contrast, results in a linearized conformation in which the extent of 
15 quenching is decreased. Thus, by monitoring emission changes for the two dyes, it is 
possible to indirectly monitor the formation of amplification product. 

Probes 

20 The labeled probe is selected so that its sequence is substantially complementary to a 
segment of the test locus or a reference locus. As indicated above, the nucleic acid site 
to which the probe binds should be located between the primer binding sites for the 
upstream and downstream amplification primers. 

25 Primers 

The primers used in the amplification are selected so as to be capable of hybridizing to 
sequences at flanking regions of the locus being amplified. The primers are chosen to 
have at least substantial complementarity with the different strands of the nucleic acid 
30 being amplified. When a probe is utilized to detect the formation of amplification 
products, the primers are selected in such that they flank the probe, i.e. are located 
upstream and downstream of the probe. 
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The primer must have sufficient length so that it is capable of priming the synthesis of 
extension products in the presence of an agent for polymerization. The length and 
composition of the primer depends on many parameters, including, for example, the 
5 temperature at which the annealing reaction is conducted, proximity of the probe 
binding site to that of the primer, relative concentrations of the primer and probe and the 
particular nucleic acid composition of the probe. Typically the primer includes 15-30 
nucleotides. However, the length of the primer may be more or less depending on the 
complexity of the primer binding site and the factors listed above. 

10 

Labels for Probes and Primers 

The labels used for labeling the probes or primers of the current invention and which can 
provide the signal corresponding to the quantity of amplification product can take a 

15 variety of forms. As indicated above with regard to the 5' fluorogenic nuclease method, 
a fluorescent signal is one signal which can be measured. However, measurements may 
also be made, for example, by monitoring radioactivity, colorimetry, absorption, 
magnetic parameters, or enzymatic activity. Thus, labels which can be employed 
include, but are not limited to, fluorophors, chromophores, radioactive isotopes, electron 

20 dense reagents, enzymes, and ligands having specific binding partners (e.g., biotin- 
avidin). 

Monitoring changes in fluorescence is a particularly useful way to monitor the 
accumulation of amplification products. A number of labels useful for attachment to 
25 probes or primers are commercially available including fluorescein and various 
fluorescein derivatives such as FAM, HEX, TET and JOE (all which are available from 
Applied Biosystems, Foster City, Calif); lucifer yellow, and coumarin derivatives. 

Labels may be attached to the probe or primer using a variety of techniques and can be 
30 attached at the 5' end, and/or the 3 1 end and/or at an internal nucleotide. The label can 
also be attached to spacer arms of various sizes which are attached to the probe or 
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primer. These spacer arms are useful for obtaining a desired distance between multiple 
labels attached to the probe or primer. 

In some instances, a single label may be utilized; whereas, in other instances, such as 
5 with the 5* fluorogenic nuclease assays for example, two or more labels are attached to 
the probe. In cases wherein the probe includes multiple labels, it is generally advisable 
to maintain spacing between the labels which is sufficient to permit separation of the 
labels during digestion of the probe through the 5-3 1 nuclease activity of the nucleic acid 
polymerase. 

10 

Patients Exhibiting Symptoms of Disease 

A number of diseases are associated with changes in the copy number of a certain gene. 
For patients having symptoms of a disease, the real-time PCR method can be used to 
15 determine if the patient has copy number alterations which are known to be linked with 
diseases that are associated with the symptoms the patient has. 

DPP4 expression 

20 DPP4 fusion proteins 

Fusion proteins are useful for generating antibodies against DPP4 polypeptides and for 
use in various assay systems. For example, fusion proteins can be used to identify 
proteins which interact with portions of DPP4 polypeptides. Protein affinity chromato- 
25 graphy or library-based assays for protein-protein interactions, such as the yeast two- 
hybrid or phage display systems, can be used for this purpose. Such methods are well 
known in the art and also can be used as drug screens. 

A DPP4 fusion protein comprises two polypeptide segments fused together by means of 
30 a peptide bond. The first polypeptide segment can comprise at least 54, 75, 100, 125, 
139, 150, 175, 200, 225, 250, 275, 300, 325 or 350 contiguous amino acids of SEQ ID 
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NO: 2 or of a biologically active variant, such as those described above. The first 
polypeptide segment also can comprise full-length DPP4. 

The second polypeptide segment can be a full-length protein or a protein fragment. 
5 Proteins commonly used in fusion protein construction include, but are not limited to p 
galactosidase, p-glucuronidase, green fluorescent protein (GFP), autofluorescent 
proteins, including blue fluorescent protein (BFP), glutathione-S-transferase (GST), 
luciferase, horseradish peroxidase (HRP), and chloramphenicol acetyltransferase (CAT). 
Additionally, epitope tags are used in fusion protein constructions, including histidine 
10 (His) tags, FLAG tags, influenza hemagglutinin (HA) tags, Myc tags, VSV-G tags, and 
thioredoxin (Trx) tags. Other fusion constructions can include maltose binding protein 
(MBP), S-tag, Lex a DNA binding domain (DBD) fusions, GAL4 DNA binding domain 
fusions, and herpes simplex virus (HSV) BP16 protein fusions. A fusion protein also 
can be engineered to contain a cleavage site located adjacent to the DPP4. 

15 

Preparation of Polynucleotides 

A naturally occurring DPP4 polynucleotide can be isolated free of other cellular 
components such as membrane components, proteins, and lipids. Polynucleotides can 

20 be made by a cell and isolated using standard nucleic acid purification techniques, or 
synthesized using an amplification technique, such as the polymerase chain reaction 
(PCR), or by using an automatic synthesizer. Methods for isolating polynucleotides are 
routine and are known in the art. Any such technique for obtaining a polynucleotide can 
be used to obtain isolated DPP4 polynucleotides. For example, restriction enzymes and 

25 probes can be used to isolate polynucleotide fragments which comprise DPP4 nucleotide 
sequences. Isolated polynucleotides are in preparations which are free or at least 70, 80, 
or 90% free of other molecules. 

DPP4 cDNA molecules can be made with standard molecular biology techniques, using 
30 DPP4 mRNA as a template. DPP4 cDNA molecules can thereafter be replicated using 
molecular biology techniques known in the art. An amplification technique, such as 
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PCR, can be used to obtain additional copies of polynucleotides of the invention, using 
either human genomic DNA or cDNA as a template. 

Alternatively, synthetic chemistry techniques can be used to synthesizes DPP4 
5 polynucleotides. The degeneracy of the genetic code allows alternate nucleotide 
sequences to be synthesized which will encode DPP4 having, for example, an amino 
acid sequence shown in SEQ ID NO: 2 or a biologically active variant thereof. 

Extending Polynucleotides 

10 

Various PCR-based methods can be used to extend nucleic acid sequences encoding 
human DPP4, for example to detect upstream sequences of DPP4 gene such as 
promoters and regulatory elements. For example, restriction-site PCR uses universal 
primers to retrieve unknown sequence adjacent to a known locus. Genomic DNA is first 
15 amplified in the presence of a primer to a linker sequence and a primer specific to the 
known region. The amplified sequences are then subjected to a second round of PCR 
with the same linker primer and another specific primer internal to the first one. 
Products of each round of PCR are transcribed with an appropriate RNA polymerase and 
sequenced using reverse transcriptase. 

20 

Inverse PCR also can be used to amplify or extend sequences using divergent primers 
based on a known region. Primers can be designed using commercially available 
software, such as OLIGO 4.06 Primer Analysis software (National Biosciences Inc., 
Plymouth, Minn.), to be 22-30 nucleotides in length, to have a GC content of 50% or 
25 more, and to anneal to the target sequence at temperatures about 68-72°C. The method 
uses several restriction enzymes to generate a suitable fragment in the known region of a 
gene. The fragment is then circularized by intramolecular ligation and used as a PCR 
template. 

30 Another method which can be used is capture PCR, which involves PCR amplification 
of DNA fragments adjacent to a known sequence in human and yeast artificial 
chromosome DNA. In this method, multiple restriction enzyme digestions and ligations 
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also can be used to place an engineered double-stranded sequence into an unknown 
fragment of the DNA molecule before performing PCR. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
5 size-selected to include larger cDNAs. Randomly-primed libraries are preferable, in that 
they will contain more sequences which contain the 5* regions of genes. Use of a 
randomly primed library may be especially preferable for situations in which an oligo 
d(T) library does not yield a full-length cDNA. Genomic" libraries can be useful for 
extension of sequence into 5 1 non-transcribed regulatory regions. 

10 

Commercially available capillary electrophoresis systems can be used to analyze the size 
or confirm the nucleotide sequence of PCR or sequencing products. For example, 
capillary sequencing can employ flowable polymers for electrophoretic separation, four 
different fluorescent dyes (one for each nucleotide) which are laser activated, and 

15 detection of the emitted wavelengths by a charge coupled device camera. Output/light 
intensity can be converted to electrical signal using appropriate equipment and software 
(e.g., GENOTYPER and Sequence NAVIGATOR, Perlrin Elmer), and the entire process 
from loading of samples to computer analysis and electronic data display can be 
computer controlled. Capillary electrophoresis is especially preferable for the 

20 sequencing of small pieces of DNA which might be present in limited amounts in a 
particular sample. 

Obtaining Polypeptides 

25 DPP4 can be obtained, for example, by purification from human cells, by expression of 
DPP4 polynucleotides, or by direct chemical synthesis. 

Protein Purification 

30 DPP4 can be purified from any human cell which expresses the enzyme, including those 
which have been transfected with expression constructs which express DPP4. A purified 
DPP4 is separated from other compounds which normally associate with DPP4 in the 
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cell, such as certain proteins, carbohydrates, or lipids, using methods well-known in the 
art. Such methods include, but are not limited to, size exclusion chromatography, 
ammonium sulfate fractionation, ion exchange chromatography, affinity 
chromatography, and preparative gel electrophoresis. 

5 

Expression of DPP4 Polynucleotides 

To express DPP4, DPP4 polynucleotides can be inserted into an expression vector which 
contains the necessary elements for the transcription and translation of the inserted 
10 coding sequence. Methods which are well known to those skilled in the art can be used 
to construct expression vectors containing sequences encoding DPP4 and appropriate 
transcriptional and translational control elements. These methods include in vitro 
recombinant DNA techniques, synthetic techniques, and in vivo genetic recombination. 

15 A variety of expression vector/host systems can be utilized to contain and express 
sequences encoding DPP4. These include, but are not limited to, microorganisms, such 
as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA 
expression vectors; yeast transformed with yeast expression vectors, insect cell systems 
infected with virus expression vectors (e.g., baculovirus), plant cell systems transformed 

20 with virus expression vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic 
virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids), or 
animal cell systems. 

The control elements or regulatory sequences are those non-translated regions of the 
25 vector - enhancers, promoters, 5 1 and 3' untranslated regions — which interact with host 
cellular proteins to carry out transcription and translation. Such elements can vary in 
their strength and specificity. D epending on the vector system and host utilized, any 
number of suitable transcription and translation elements, including constitutive and 
inducible promoters, can be used. For example, when cloning in bacterial systems, 
30 inducible promoters such as the hybrid lacZ promoter of the BLUESCRIPT phagemid 
(Stratagene, LaJolla, Calif.) or pSPORTl plasmid (Life Technologies) and the like can 
be used. The baculovirus polyhedrin promoter can be used in insect cells. Promoters or 
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enhancers derived from the genomes of plant cells (e.g., heat shock, RUBISCO, and 
storage protein genes) or from plant viruses (e.g., viral promoters or leader sequences) 
can be cloned into the vector. In mammalian cell systems, promoters from mammalian 
genes or from mammalian viruses are preferable. If it is necessary to generate a cell line 
5 that contains multiple copies of a nucleotide sequence encoding DPP4, vectors based on 
SV40 or EBV can be used with an appropriate selectable marker. 

Bacterial and Yeast Expression Systems 

10 In bacterial systems, a number of expression vectors can be selected. For example, 
when a large quantity of DPP4 is needed for the induction of antibodies, vectors which 
direct high level expression of fusion proteins that are readily purified can be used. 
Such vectors include, but are not limited to, multifunctional E. coli cloning and 
expression v ectors s uch a s B LUESCRIPT ( Stratagene). In a BLUESCRIPT vector, a 

15 sequence encoding DPP4 can be ligated into the vector in frame with sequences for the 
amino-terminal Met and the subsequent 7 residues of (J-galactosidase so that a hybrid 
protein is produced. pIN vectors or pGEX vectors (Promega, Madison, Wis.) also can 
be used to express foreign polypeptides as fusion proteins with glutathione S-transferase 
(GST). In general, such fusion proteins are soluble and can easily be purified from lysed 

20 cells by adsorption to glutathione-agarose beads followed by elution in the presence of 
free glutathione. Proteins made in such systems can be designed to include heparin, 
thrombin, or factor Xa protease cleavage sites so that the cloned polypeptide of interest 
can be released from the GST moiety at will. 

25 Plant and Insect Expression Systems 

If plant expression vectors are used, the expression of sequences encoding DPP4 can be 
driven by any of a number of promoters. For example, viral promoters such as the 35S 
and 19S promoters of CaMV can be used alone or in combination with the omega leader 
30 sequence from TMV. Alternatively, plant promoters such as the small subunit of 
RUBISCO or heat shock promoters can be used. These constructs can be introduced 
into plant cells by direct DNA transformation or by pathogen-mediated transfection. 
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An insect system also can be used to express DPP4. For example, in one such system 
Autographa califomica nuclear polyhedrosis virus (AcNPV) is used as a vector to 
express foreign genes in Spodoptera frugiperda cells or in Trichoplusia larvae, 
5 Sequences encoding DPP4 can be cloned into a non-essential region of the virus, such as 
the polyhedrin gene, and placed under control of the polyhedrin promoter. Successful 
insertion of DPP4 will render the polyhedrin gene inactive and produce recombinant 
virus lacking coat protein. The recombinant viruses can then be used to infect S. 
frugiperda cells or Trichoplusia larvae in which DPP4 can be expressed. 

10 

Mammalian Expression Systems 

A number of viral-based expression systems can be used to express DPP4 in mammalian 
host cells. For e xample, i f a n a denovirus i s u sed a s a n e xpression v ector, s equences 

15 encoding DPP4 can be ligated into an adenovirus transcription/translation complex 
comprising the late promoter and tripartite leader sequence. Insertion in a non-essential 
El or E3 region of the viral genome can be used to obtain a viable virus which is 
capable of expressing DPP4 in infected host cells [Engelhard, 1994)]. If desired, 
transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, can be used to 

20 increase expression in mammalian host cells. 

Human artificial chromosomes (HACs) also can be used to deliver larger fragments of 
DNA than can be contained and expressed in a plasmid. HACs of 6M to 10M are 
constructed and delivered to cells via conventional delivery methods (e.g., liposomes, 

25 polycationic amino polymers, or vesicles). Specific initiation signals also can be used to 
achieve more efficient translation of sequences encoding DPP4. Such signals include 
the ATG initiation codon and adjacent sequences. In cases where sequences encoding 
DPP4, its initiation codon, and upstream sequences are inserted into the appropriate 
expression vector, no additional transcriptional or translational control signals may be 

30 needed. However, in cases where only coding sequence, or a fragment thereof, is 
inserted, exogenous translational control signals (including the ATG initiation codon) 
should be provided. The initiation codon should be in the correct reading frame to 
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ensure translation of the entire insert. Exogenous translational elements and initiation 
codons can be of various origins, both natural and synthetic. 

Host Cells 

5 

A host cell strain can be chosen for its ability to modulate the expression of the inserted 
sequences or to process the expressed DPP4 in the desired fashion. Such modifications 
of the polypeptide include, but are not limited to, acetylation, carboxylation, 
glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing 

10 which cleaves a "prepro" form of the polypeptide also can be used to facilitate correct 
insertion, folding and/or function. Different host cells which have specific cellular 
machinery and characteristic mechanisms for post-translational activities (e.g., CHO, 
HeLa, MDCK, HEK293, and WI38), are available from the American Type Culture 
Collection (ATCC; 10801 University Boulevard, Manassas, VA 201 10-2209) and can be 

15 chosen to ensure the correct modification and processing of the foreign protein. 

Stable expression is preferred for long-term, high-yield production of recombinant 
proteins. For example, cell lines which stably express DPP4 can be transformed using 
expression vectors which can contain viral origins of replication and/or endogenous 

20 expression elements and a selectable marker gene on the same or on a separate vector. 
Following the introduction of the vector, cells can be allowed to grow for 1-2 days in an 
enriched medium before they are switched to a selective medium. The purpose of the 
selectable marker is to confer resistance to selection, and its presence allows growth and 
recovery of cells which successfully express the introduced DPP4 sequences. Resistant 

25 clones of stably transformed cells can be proliferated using tissue culture techniques 
appropriate to the cell type. Any number of selection systems can be used to recover 
transformed cell lines. These include, but are not limited to, the herpes simplex virus 
thymidine kinase [Logan, (1984)] and adenine phosphoribosyltransferase [Wigler, 
(1977)] genes which can be employed in tk" or aprf cells, respectively. Also, 

30 antimetabolite, antibiotic, or herbicide resistance can be used as the basis for selection. 
For example, dhfr confers resistance to methotrexate [Lowy, (1980)], npt confers 
resistance to the aminoglycosides, neomycin and G-418 [Wigler, (1980)], and als and 
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pat confer resistance to chlorsulfiiron and phosphinotricin acetyltransferase, respectively 
[Colbere-Garapin, 1981]. Additional selectable genes have been described. For 
example, trpB allows cells to utilize indole in place of tryptophan, or hisD, which allows 
cells to utilize histinol in place of histidine. Visible markers such as anthocyanins, |$- 
5 glucuronidase and its substrate GUS, and luciferase and its substrate luciferin, can be 
used to identify transformants and to quantify the amount of transient or stable protein 
expression attributable to a specific vector system 

Detecting Polypeptide Expression 

10 

Although the presence of marker gene expression suggests that a DPP4 polynucleotide is 
also present, its presence and expression may need to be confirmed. For example, if a 
sequence encoding DPP4 is inserted within a marker gene sequence, transformed cells 
containing sequences which encode DPP4 can be identified by the absence of marker 
15 gene function. Alternatively, a marker gene can be placed in tandem with a sequence 
encoding DPP4 under the control of a single promoter. Expression of the marker gene 
in response to induction or selection usually indicates expression of DPP4 
polynucleotide. 

20 Alternatively, host cells which contain a DPP4 polynucleotide and which express DPP4 
can be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations and 
protein bioassay or immunoassay techniques which include membrane, solution, or chip- 
based technologies for the detection and/or quantification of nucleic acid or protein. For 

25 example, the presence of a polynucleotide sequence encoding DPP4 can be detected by 
DNA-DNA or DNA-RNA hybridization or amplification using probes or fragments or 
fragments of polynucleotides encoding DPP4. Nucleic acid amplification-based assays 
involve the use of oligonucleotides selected from sequences encoding DPP4 to detect 
transformants which contain a DPP4 polynucleotide. 

30 

A variety of protocols for detecting and measuring the expression of DPP4, using either 
polyclonal or monoclonal antibodies specific for the polypeptide, are known in the art. 
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Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay 
(RIA), and fluorescence activated cell sorting (FACS). A two-site, monoclonal-based 
immunoassay using monoclonal antibodies reactive to two non-interfering epitopes on 
DPP4 can be used, or a competitive binding assay can be employed. 

5 

A wide variety of labels and conjugation techniques are known by those skilled in the art 
and can be used in various nucleic acid and amino acid assays. Means for producing 
labeled hybridization or PCR probes for detecting sequences related to polynucleotides 
encoding DPP4 include oligolabeling, nick translation, end-labeling, or PCR 

10 amplification using a labeled nucleotide. Alternatively, sequences encoding DPP4 can 
be cloned into a vector for the production of an mRNA probe. Such vectors are known 
in the art, are commercially available, and can be used to synthesize RN A probes in 
vitro by addition of labeled nucleotides and an appropriate RNA polymerase such as T7, 
T3, or SP6. These procedures can be conducted using a variety of commercially 

15 available kits (Amersham Pharmacia Biotech, Promega, and US Biochemical). Suitable 
reporter molecules or labels which can be used for ease of detection include 
radionuclides, enzymes, and fluorescent, chemiluminescent, or chromogenic agents, as 
well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

20 Expression and Purification of Polypeptides 

Host cells transformed with DPP4 polynucleotides can be cultured under conditions 
suitable for the expression and recovery of the protein from cell culture. The 
polypeptide produced by a transformed cell can be secreted or contained intracellularly 
25 depending on the sequence and/or the vector used. As will be understood by those of 
skill in the art, expression vectors containing DPP4 polynucleotides can be designed to 
contain signal sequences which direct secretion of soluble DPP4 through a prokaryotic 
or eukaryotic cell membrane or which direct the membrane insertion of membrane-, 
bound DPP4. 

30 

As discussed above, other constructions can be used to join a sequence encoding DPP4 
to a nucleotide sequence encoding a polypeptide domain which will facilitate 
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purification of soluble proteins. Such purification facilitating domains include, but are 
not limited to, metal chelating peptides such as histidine-tiyptophan modules that allow 
purification on immobilized metals, protein A domains that allow purification on 
immobilized immunoglobulin, and the domain utilized in the FLAGS extension/affinity 
5 purification system (Immunex Corp., Seattle, Wash.). Inclusion of cleavable linker 
sequences such as those specific for Factor XA or enterokinase (Invitrogen, San Diego, 
CA) between the purification domain and DPP4 also can be used to facilitate 
purification. One such expression vector provides for expression of a fusion protein 
containing DPP4 and 6 histidine residues preceding a thioredoxin or an enterokinase 
10 cleavage site. The histidine residues facilitate purification by IMAC (immobilized metal 
ion affinity chromatography) Maddox, (1983)], while the enterokinase cleavage site 
provides a means for purifying DPP4 from the fusion protein [Porath, (1992)]. 

Chemical Synthesis 

15 

Sequences encoding DPP4 can be synthesized, in whole or in part, using chemical 
methods well known in the art. Alternatively, DPP4 itself can be produced using 
chemical methods to synthesize its amino acid sequence, such as by direct peptide 
synthesis using solid-phase techniques. Protein synthesis can either be performed using 
20 manual techniques or by automation. Automated synthesis can be achieved, for 
example, using Applied Biosystems 431 A Peptide Synthesizer (Perkin Elmer). 
Optionally, fragments of DPP4 can be separately synthesized and combined using 
chemical methods to produce a full-length molecule. 

25 The newly synthesized peptide can be substantially purified by preparative high 
performance liquid chromatography. The composition of a synthetic DPP4 can be 
confirmed by amino acid analysis or sequencing. Additionally, any portion of the amino 
acid sequence of DPP4 can be altered during direct synthesis and/or combined using 
chemical methods with sequences from other proteins to produce a variant polypeptide 

30 or a fusion protein. 
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Production of Altered Polypeptides 

As will be understood by those of skill in the art, it may be advantageous to produce 
DPP4 polynucleotides possessing non-naturally occurring codons. For example, codons 
5 preferred by a particular prokaryotic or eukaryotic host can be selected to increase the 
rate of protein expression or to produce an RNA transcript having desirable properties, 
such as a half-life which is longer than that of a transcript generated from the naturally 
occurring sequence. 

10 The nucleotide sequences referred to herein can be engineered using methods generally 
known in the art to alter DPP4 polynucleotides for a variety of reasons, including but not 
limited to, alterations which modify the cloning, processing, and/or expression of the 
polypeptide or mRNA product. DNA shuffling by random fragmentation and PCR re- 
assembly of gene fragments and synthetic oligonucleotides can be used to engineer the 

15 nucleotide sequences. For example, site-directed mutagenesis can be used to insert new 
restriction sites, alter glycosylation patterns, change codon preference, produce splice 
variants, introduce mutations, and so forth. 

DPP4 Analogs 

20 

One general class of DPP4 analogs are variants having an amino acid sequence that is a 
mutation of the amino acid sequence disclosed herein. Another general class of DPP4 
analogs is provided by anti-idiotype antibodies, and fragments thereof, as described 
below. Moreover, recombinant antibodies comprising anti-idiotype variable domains can 
25 be used as analogs (see, for example, [Monferdini et al., ( 1996)]). Since the variable 
domains of anti-idiotype DPP4 antibodies mimic DPP4, these domains can provide 
DPP4 enzymatic activity. Methods of producing anti-idiotypic catalytic antibodies are 
known to those of skill in the art [Joron et al., (1992), Friboulet et al. (1994), Avalle et 
al, (1998)]. 

30 

Another approach to identifying DPP4 analogs is provided by the use of combinatorial 
libraries. Methods for constructing and screening phage display and other combinatorial 
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libraries are provided, for example, by [Kay et al., Phage Display of Peptides and 
Proteins (Academic Press 1996), U.S. 5,783,384, U.S. 5,747,334, and U.S. 5,723,323. 

One illustrative in vitro use of DPP4 and its analogs is the production of labeled peptides 
5 from a labeled protein substrate. Proteases can also be used in detergents and cleaning 
solutions. For example, serine proteases are used in solutions to clean and to disinfect 
contact lenses (see, for example, [U.S. 5,985,629]). Another use for a serine protease is 
in the formulation of vaccines (see, for example, [U.S. 5,885,814]). Those of skill in the 
art can devise other uses for molecules having DPP4 activity. 

10 

Antibodies 

Any type of antibody known in the art can be generated to bind specifically to an epitope 
ofDPP4. 

15 

"Antibody as used herein includes intact immunoglobulin molecules, as well as 
fragments thereof, such as Fab, F(ab% and Fv, which are capable of binding an epitope 
of DPP4. Typically, at least 6, 8, 10, or 12 contiguous amino acids are required to form 
an epitope. However, epitopes which involve non-contiguous amino acids may require 

20 more, eg., at least 15, 25, or 50 amino acid. An antibody which specifically binds to an 
epitope of DPP4 can be used therapeutically, as well as in immunochemical assays, such 
as Western blots, ELISAs, radioimmunoassays, immunohistochemical assays, 
immunoprecipitations, or other immunochemical assays known in the art. Various 
immunoassays can be used to identify antibodies having the desired specificity. 

25 Numerous protocols for competitive binding or immunoradiometric assays are well 
known in the art. Such immunoassays typically involve the measurement of complex 
formation between an immunogen and an antibody which specifically binds to the DPP4 
immunogen. 

30 Typically, an antibody which specifically binds to DPP4 provides a detection signal at 
least 5-, 10-, or 20-fold higher than a detection signal provided with other proteins when 
used in an immunochemical assay. Preferably, antibodies which specifically bind to 
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DPP4 do not detect other proteins in immunochemical assays and can immuno- 
precipitate DPP4 from solution. 

DPP4 can be used to immunize a mammal, such as a mouse, rat, rabbit, guinea pig, 
5 monkey, or human, to produce polyclonal antibodies. If desired, DPP4 can be 
conjugated to a carrier protein, such as bovine serum albumin, thyroglobulin, and 
keyhole limpet hemocyanin. Depending on the host species, various adjuvants can be 
used to increase the immunological response. Such adjuvants include, but are not 
limited to, Freund's adjuvant, mineral gels (e.g., aluminum hydroxide), and surface 
10 active substances (e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, keyhole limpet hemocyanin, and dinitrophenol). Among adjuvants used in 
humans, BCG (bacilli Calmette-Guerin) and Corynebacterium parvum are especially 
useful. 

15 Monoclonal antibodies which specifically bind to DPP4 can be prepared using any 
technique which provides for the production of antibody molecules by continuous cell 
lines in culture. These techniques include, but are not limited to, the hybridoma 
technique, the human B-cell hybridoma technique, and the EBV-hybridoma technique 
[Roberge, (1995)]. 

20 

In addition, techniques developed for the production of "chimeric antibodies", the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with 
appropriate antigen specificity and biological activity, can be used. Monoclonal and 
other antibodies also can be "humanized" to prevent a patient from mounting an immune 

25 response against the antibody when it is used therapeutically. Such antibodies may be 
sufficiently similar in sequence to human antibodies to be used directly in therapy or 
may require alteration of a few key residues. Sequence differences between rodent 
antibodies and human sequences can be minimized by replacing residues which differ 
from those in the human sequences by site directed mutagenesis of individual residues or 

30 by grating of entire complementarity determining regions. Antibodies which 
specifically bind to DPP4 can contain antigen binding sites which are either partially or 
fully humanized, as disclosed in U.S. 5,565,332. 
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Altematively, techniques described for the production of single chain antibodies can be 
adapted using methods known in the art to produce single chain antibodies which 
specifically bind to DPP4. Antibodies with related specificity, but of distinct idiotypic 
5 composition, can be generated by chain shuffling from random combinatorial 
immunoglobin libraries. Single-chain antibodies also can be constructed using a DNA 
amplification method, such as PCR, using hybridoma cDNA as a template. Single-chain 
antibodies can be mono- or bispecific, and can be bivalent or tetravalent. Construction 
of tetravalent, bispecific single-chain antibodies is taught. A nucleotide sequence 
10 encoding a single-chain antibody can be constructed using manual or automated 
nucleotide synthesis, cloned into an expression construct using standard recombinant 
DNA methods, and introduced into a cell to express the coding sequence, as described 
below. Alternatively, single-chain antibodies can be produced directly using, for 
example, filamentous phage technology. 

15 

Antibodies which specifically bind to DPP4 also can be produced by inducing in vivo 
production in the lymphocyte population or by screening immunoglobulin libraries or 
panels of highly specific binding reagents. Other types of antibodies can be constructed 
and used therapeutically in methods of the invention. For example, chimeric antibodies 
20 can be constructed as disclosed in WO 93/03151. Binding proteins which are derived 
from immunoglobulins and which are multivalent and multispecific, such as the 
"diabodies" described in WO 94/13804, also can be prepared. 

Antibodies according to the invention can be purified by methods well known in the art 
25 For example, antibodies can be affinity purified by passage over a column to which 
DPP4 is bound. The bound antibodies can then be eluted from the column using a buffer 
with a high salt concentration. 

Antisense Oligonucleotides 

30 

Antisense oligonucleotides are nucleotide sequences which are complementary to a 
specific DNA or RNA sequence. Once introduced into a cell, the complementary 
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nucleotides combine with natural sequences produced by the cell to form complexes and 
block either transcription or translation. Preferably, an antisense oligonucleotide is at 
least 1 1 nucleotides in length, but can be at least 12, 15, 20, 25, 30, 35, 40, 45, or 50 or 
more nucleotides long. Longer sequences also can be used. Antisense oligonucleotide 
5 molecules can be provided in a DNA construct and introduced into a cell as described 
above to decrease the level of DPP4 gene products in the cell. 

Antisense oligonucleotides can be deoxyribonucleotides, ribonucleotides, or a combi- 
nation of both. Oligonucleotides can be synthesized manually or by an automated 
10 synthesizer, by covalently linking the 5' end of one nucleotide with the 3' end of another 
nucleotide with non-phosphodiester internucleotide linkages such alkylphosphonates, 
phosphorothioates, phosphorodithioates, alkylphosphonothioates, alkylphosphonates, 
phosphoramidates, phosphate esters, carbamates, acetamidate, carboxymethyl esters, 
carbonates, and phosphate triesters. 

15 

Modifications of DPP4 gene expression can be obtained by designing antisense 
oligonucleotides which will form duplexes to the control, 5', or regulatory regions of the 
DPP4 gene. Oligonucleotides derived from the transcription initiation site, e.g., between 
positions -10 and +10 from the start site, are preferred. Similarly, inhibition can be 

20 achieved using "triple helix" base-pairing methodology. Triple helix pairing is useful 
because it causes inhibition of the ability of the double helix to open sufficiently for the 
binding of polymerases, transcription factors, or chaperons. Therapeutic advances using 
triplex DNA have been described in the literature [Nicholls, (1993)]. An antisense 
oligonucleotide also can be designed to block translation of mKNA by preventing the 

25 transcript from binding to ribosomes. 

Precise complementarity is not required for successful complex formation between an 
antisense oligonucleotide and the complementary sequence of a DPP4 polynucleotide. 
Antisense oligonucleotides which comprise, for example, 2, 3, 4, or 5 or more stretches 
30 of contiguous nucleotides which are precisely complementary to a DPP4 polynucleotide, 
each separated by a stretch of contiguous nucleotides which are not complementary to 
adjacent DPP4 nucleotides, can provide sufficient targeting specificity for DPP4 mRNA. 



WO 2004/104216 



PCT/EP2004/005071 



-37- 

* 

Preferably, each stretch of complementary contiguous nucleotides is at least 4, 5, 6, 7, or 
8 or more nucleotides in length. Non-complementary intervening sequences are 
preferably 1, 2, 3, or 4 nucleotides in length. One skilled in the art can easily use the 
calculated melting point of an antisense-sense pair to determine the degree of 

5 mismatching which will be tolerated between a particular antisense oligonucleotide and 
a particular DPP4 polynucleotide sequence. Antisense oligonucleotides can be modified 
without affecting their ability to hybridize to a DPP4 polynucleotide. These 
modifications can be internal or at one or both ends of the antisense molecule. For 
example, internucleoside phosphate linkages can be modified by adding cholesteryl or 

10 diamine moieties with varying numbers of carbon residues between the amino groups 
and terminal ribose. Modified bases and/or sugars, such as arabinose instead of ribose, 
or a 3', S'-substituted oligonucleotide in which the 3' hydroxyl group or the 5 1 phosphate 
group are substituted, also can be employed in a modified antisense oligonucleotide. 
These modified oligonucleotides can be prepared by methods well known in the art. 

15 

Ribozymes 

Ribozymes are RNA molecules with catalytic activity [Uhlmann, (1987)]. Ribozymes 
can be used to inhibit gene function by cleaving an RNA sequence, as is known in the 

20 art. The mechanism of ribozyme action involves sequence-specific hybridization of the 
ribozyme molecule to complementary target RNA, followed by endonucleolytic 
cleavage.. Examples include engineered hammerhead motif ribozyme molecules that can 
specifically and efficiently catalyze endonucleolytic cleavage of specific nucleotide 
sequences. The coding sequence of a DPP4 polynucleotide can be used to generate 

25 ribozymes which will specifically bind to mRNA transcribed from a DPP4 
polynucleotide. Methods of designing and constructing ribozymes which can cleave 
other RNA molecules in trans in a highly sequence specific manner have been developed 
and described in the art. For example, the cleavage activity of ribozymes can be targeted 
to specific RNAs by engineering a discrete "hybridization" region into the ribozyme. 

30 The hybridization region contains a sequence complementary to the target RNA and thus 
specifically hybridizes with the target RNA. 
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Specific ribozyme cleavage . sites within a DPP4 RNA target can be identified by 
scanning the target molecule for ribozyme cleavage sites which include the following 
sequences: GUA, GUU, and GUC. Once identified, short RNA sequences of between 
15 and 20 ribonucleotides corresponding to the region of the target RNA containing the 

5 cleavage site can be evaluated for secondary structural features which may render the 
target inoperable. Suitability of candidate DPP4 RNA targets also can be evaluated by 
testing accessibility to hybridization with complementary oligonucleotides using 
ribonuclease protection assays. The nucleotide sequences shown in SEQ ID NO: 1 and 
its complement provide sources of suitable hybridization region sequences. Longer 

10 complementary sequences can be used to increase the affinity of the hybridization 
sequence for the target. The hybridizing and cleavage regions of the ribozyme can be 
integrally related such that upon hybridizing to the target RNA through the 
complementary regions, the catalytic region of the ribozyme can cleave the target 

15 Ribozymes can be introduced into cells as part of a DNA construct. Mechanical 
methods, such as microinjection, liposome-mediated transfection, electroporation, or 
calcium phosphate precipitation, can be used to introduce a ribozyme-containing DNA 
construct into cells in which it is desired to decrease DPP4 expression. Alternatively, if 
it is desired that the cells stably retain the DNA construct, the construct can be supplied 

20 on a plasmid and maintained as a separate element or integrated into the genome of the 
cells, as is known in the art. A ribozyme-encoding DNA construct can include 
transcriptional regulatory elements, such as a promoter element, an enhancer or UAS 
element, and a transcriptional terminator signal, for controlling transcription of 
ribozymes in the cells (U.S. 5,641,673). Ribozymes also can be engineered to provide 

25 an additional level of regulation, so that destruction of mRNA occurs only when both a 
ribozyme and a target gene are induced in the cells. 

Screening / Screening Assays 

30 Regulators 

Regulators as used herein, refer to compounds that affect the activity of DPP4 in vivo 
and/or in vitro. Regulators can be agonists and antagonists of DPP4 polypeptide and can 
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be c ompounds t hat e xert t heir e ffect o n t he D PP4 a ctivity v ia t he e nzymatic a cti vity, 
expression, post-translational modifications or by other means. Agonists of DPP4 are 
molecules which, when bound to DPP4, increase or prolong the activity of DPP4. 
Agonists of DPP4 include proteins, nucleic acids, carbohydrates, small molecules, or 
5 any other molecule which activate DPP4. Antagonists of DPP4 are molecules which, 
when bound to DPP4, decrease the amount or the duration of the activity of DPP4. 
Antagonists include proteins, nucleic acids, carbohydrates, antibodies, small molecules, 
or any other molecule which decrease the activity of DPP4. 

10 The term "modulate", as it appears herein, refers to a change in the activity of DPP4 
polypeptide. For example, modulation may cause an increase or a decrease in enzymatic 
activity, binding characteristics, or any other biological, functional, or immunological 
properties ofDPP4. 

15 As used herein, the terms "specific binding" or "specifically binding" refer to that 
interaction between a protein or peptide and an agonist, an antibody, or an antagonist. 
The interaction is dependent upon the presence of a particular structure of the protein 
recognized by the binding molecule (i.e., the antigenic determinant or epitope). For 
example, if an antibody is specific for epitope "A" the presence of a polypeptide 

20 containing the epitope A, or the presence of free unlabeled A, in a reaction containing 
free labeled A and the antibody will reduce the amount of labeled A that binds to the 
antibody. 

The invention provides methods (also referred to herein as "screening assays") for 
25 identifying compounds which can be used for the treatment of diseases related to DPP4. 
The methods entail the identification of candidate or test compounds or agents (e.g., 
peptides, peptidomimetics, small molecules or other molecules) which bind to DPP4 
and/or have a stimulatory or inhibitory effect on the biological activity of DPP4 or its 
expression and then determining which of these compounds have an effect on symptoms 
30 or diseases related to DPP4 in an in vivo assay. 
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Candidate or test compounds or agents which bind to DPP4 and/or have a stimulatory or 
inhibitory effect on the activity or the expression of DPP4 are identified either in assays 
that employ cells which express DPP4 (cell-based assays) or in assays with isolated 
DPP4 (cell-free assays). The various assays can employ a variety of variants of DPP4 
5 (e.g., fiill-length DPP4, a biologically active fragment of DPP4, or a fusion protein 
which includes all or a portion of DPP4). Moreover, DPP4 can be derived from any 
suitable mammalian species (e.g., human DPP4, rat DPP4 or murine DPP4). The assay 
can be a binding assay entailing direct or indirect measurement of the binding of a test 
compound or a known DPP4 ligand to DPP4. The assay can also be an activity assay 
10 entailing direct or indirect measurement of the activity of DPP4. The assay can also be 
an expression assay entailing direct or indirect measurement of the expression of DPP4 
mRNA or DPP4 protein. The various screening assays are combined with an in vivo 
assay entailing measuring the effect of the test compound on the symptoms of diseases 
related to DPP4. 

15 

The present invention includes biochemical, cell free assays that allow the identification 
of inhibitors a nd a gonists o f p roteases s uitable a s 1 ead s tructures for p harmacological 
drug development. Such assays involve contacting a form of DPP4 (e.g., full-length 
DPP4, a biologically active fragment of DPP4, or a fiision protein comprising all or a 
20 portion of DPP4) with a test compound and determining the ability of the test compound 
to act as an antagonist (preferably) or an agonist of the enzymatic activity of DPP4. 

The activity of DPP4 molecules of the present invention can be measured using a variety 
of assays that measure DPP4 activity. For example, DPP4 enzyme activity can be 

25 assessed by a standard in vitro serine/metallo/... protease assay (see, for example, [U.S. 
5,057,414]). Those of skill in the art are aware of a variety of substrates suitable for in 
vitro assays, such as SucAla-Ala-Pro-Phe-pNA, fluorescein mono-p-guanidinobenzoate 
hydrochloride, benzyloxycarbonyl-L-Arginyl-S-benzylester, Nalpha-Benzoyl-L-arginine 
ethyl ester hydrochloride, and the like. In addition, protease assay kits available from 

30 commercial sources, such as Calbiochem™ (San Diego, Calif). For general references, 
see Barrett (Ed.), Methods in Enzymology, Proteolytic Enzymes: Serine and Cysteine 
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Peptidases (Academic Press Inc. 1994), and Barrett et al., (Eds.), Handbook of 
Proteolytic Enzymes (Academic Press Inc. 1998). 

Solution in vitro assays can be used to identify a DPP4 substrate or inhibitor. Solid phase 
5 systems can also be used to identify a substrate or inhibitor of a DPP4 polypeptide. For 
example, a DPP4 polypeptide or DPP4 fusion protein can be immobilized onto the 
surface of a receptor chip of a commercially available biosensor instrument (BIACORE, 
Biacore AB; Uppsala, Sweden). The use of this instrument is disclosed, for example, by 
[Karlsson, (1991), and Cunningham and Wells, (1993)]. 

10 

In brief, a DPP4 polypeptide o r fusion p rotein i s c ovalently a ttached, u sing a mine o r 
sulfhydryl chemistry, to dextran fibers that are attached to gold film within a flow cell. A 
test sample is then passed through the cell. If a DPP4 substrate or inhibitor is present in 
the sample, it will bind to the immobilized polypeptide or fusion protein, causing a 

15 change in the refractive index of the medium, which is detected as a change in surface 
plasmon resonance of the gold film. This system allows the determination on- and off- 
rates, from which binding affinity can be calculated, and assessment of the stoichiometry 
of binding, as well as the kinetic effects of DPP4 mutation. This system can also be used 
to examine antibody-antigen interactions, and the interactions of other complement/anti- 

20 complement pairs. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of DPP4. Such assays can employ 
full-length DPP4, a biologically active fragment of DPP4, or a fusion protein which 
25 includes all or a portion of DPP4. As described in greater detail below, the test 
compound can be obtained by any suitable means, e.g., from conventional compound 
libraries. 

Determining the ability of the test compound to modulate the activity of DPP4 can be 
30 accomplished, f or e xample, b y d etermining t he a bility o f D PP4 1 o b ind t o o r i nteract 
with a target molecule. The target molecule can be a molecule with which DPP4 binds 
or interacts with in nature. The target molecule can be a component of a signal 
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transduction pathway which facilitates transduction of an extracellular signal. The target 
DPP4 molecule can be, for example, a second intracellular protein which has catalytic 
activity or a protein which facilitates the association of downstream signaling molecules 
with DPP4. 

5 

Determining the ability of DPP4 to bind to or interact with a target molecule can be 
accomplished by one of the methods described above for determining direct binding. In 
one embodiment, determining the ability of a polypeptide of the invention to bind to or 
interact with a target molecule can be accomplished by determining the activity of the 

10 target molecule. For example, the activity of the target molecule can be determined by 
detecting induction of a cellular second messenger of the target (e.g., intracellular Ca 2+ , 
diacylglycerol, EP3, etc.), detecting catalytic/enzymatic activity of the target on an 
appropriate substrate, detecting the induction of a reporter gene (e.g., a regulatory 
element that is responsive to a polypeptide of the invention operably linked to a nucleic 

1 5 acid encoding a detectable marker, e.g., luciferase), or detecting a cellular response. 

In various embodiments of the above assay methods of the present invention, it may be 
desirable to immobilize DPP4 (or a DPP4 target molecule) to facilitate separation of 
complexed from uncomplexed forms of one or both of the proteins, as well as to 

20 accommodate automation of the assay. Binding of a test compound to DPP4, or 
interaction of DPP4 with a target molecule in the presence and absence of a candidate 
compound, can be accomplished in any vessel suitable for containing the reactants. 
Examples of such vessels include microtitre plates, test tubes, and micro-centrifuge 
tubes. In one embodiment, a fusion protein can be provided which adds a domain that 

25 allows one or both of the proteins to be bound to a matrix. For example, glutathione-S- 
transferase (GST) fusion proteins or glutathione-S-transferase fusion proteins can be 
adsorbed onto glutathione sepharose beads (Sigma Chemical; St. Louis, Mo.) or 
glutathione derivatized microtitre plates, which are then combined with the test 
compound or the test compound and either the non-adsorbed target protein or DPP4, and 

30 the mixture incubated under conditions conducive to complex formation (e.g., at 
physiological conditions for salt and pH). Following incubation, the beads or microtitre 
plate wells are washed to remove any unbound components and complex formation is 
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measured either directly or indirectly, for example, as described above. Alternatively, 
the complexes can be dissociated from the matrix, and the level of binding or activity of 
DPP4 can be determined using standard techniques. 

5 Other techniques for immobilizing proteins on matrices can also be used in the screening 
assays of the invention. For example, either DPP4 or its target molecule can be 
immobilized utilizing conjugation of biotin and streptavidin. Biotinylated polypeptide 
of the invention or target molecules can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques well known in the art (e.g., biotinylation kit, Pierce 

10 Chemicals; Rockford, 111.), and i mmobilized i n t he wells of streptavidin-coated plates 
(Pierce Chemical). Alternatively, antibodies reactive with DPP4 or target molecules but 
which do not interfere with binding of the polypeptide of the invention to its target 
molecule can be derivatized to the wells of the plate, and unbound target or polypeptide 
of the invention trapped in the wells by antibody conjugation. Methods for detecting 

15 such complexes, in addition to those described above for the GST-immobilized 
complexes, include immunodetection of complexes using antibodies reactive with DPP4 
or target molecule, as well as enzyme-linked assays which rely on detecting an 
enzymatic activity associated with DPP4 or target molecule. 

20 Another technique for drug screening which may be used provides for high throughput 
screening of compounds having suitable binding affinity to the protein of interest as 
described in published PCT application WO84/03564. In this method, targe numbers of 
different small test compounds are synthesized on a solid substrate, such as plastic pins 
or some other surface. The test compounds are reacted with DPP4, or fragments thereof, 

25 and washed. Bound DPP4 is then detected by methods well known in the art. Purified 
DPP4 can also be coated directly onto plates for use in the aforementioned drug 
screening techniques. Alternatively, non-neutralizing antibodies can be used to capture 
the peptide and immobilize it on a solid support. 

30 In another embodiment, one may use competitive drug screening assays in which 
neutralizing antibodies capable of binding DPP4 specifically compete with a 
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testcompound for binding DPP4. In this manner, antibodies can be used to detect the 
presence of any peptide which shares one or more antigenic determinants with DPP4. 

The screening assay can also involve monitoring the expression of DPP4. For example, 
5 regulators of expression of DPP4 can be identified in a method in which a cell is 
contacted with a candidate compound and the expression of DPP4 protein or mRNA in 
the cell is determined. The level of expression of DPP4 protein or mRNA the presence 
of the candidate compound is compared to the level of expression of DPP4 protein or 
mRNA in the absence of the candidate compound. The candidate compound can then be 
10 identified as a regulator of expression of DPP4 based on this comparison. For example, 
when expression of DPP4 protein or mRNA protein is greater (statistically significantly 
greater) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as a stimulator of DPP4 protein or mRNA expression. 
Alternatively, when expression of DPP4 protein or mRNA is less (statistically 
1 5 significantly 1 ess) i n t he p resence o f t he c andidate c ompound than in its absence, the 
candidate compound is identified as an inhibitor of DPP4 protein or mRNA expression. 
The level of DPP4 protein or mRNA expression in the cells can be determined by 
methods described below. 

20 Binding Assays 

For binding assays, the test compound is preferably a small molecule which binds to and 
occupies the active site of DPP4 polypeptide, thereby making the ligand binding site 
inaccessible to substrate such that normal biological activity is prevented. Examples of 
25 such small molecules include, but are not limited to, small peptides or peptide-like 
molecules. Potential ligands which bind to a polypeptide of the invention include, but 
are not limited to, the natural ligands of known DPP4 proteases and analogues or 
derivatives thereof. 

30 In binding assays, either the test compound or the DPP4 polypeptide can comprise a 
detectable label, such as a fluorescent, radioisotopic, chemiluminescent, or enzymatic 
label, such as horseradish peroxidase, alkaline phosphatase, or luciferase. Detection of a 
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test compound which is bound to DPP4 polypeptide can then be accomplished, for 
example, by direct counting of radioemmission, by scintillation counting, or by 
determining conversion of an appropriate substrate to a detectable product 
Alternatively, binding of a test compound to a DPP4 polypeptide can be determined 

5 without labeling either of the interactants. For example, a microphysiometer can be 
used to detect binding of a test compound with a DPP4 polypeptide. A 
microphysiometer (eg., Cytosensor™) is an analytical instrument that measures the rate 
at which a cell acidifies its environment using a light-addressable potentiometric sensor 
(LAPS). Changes in this acidification rate can be used as an indicator of the interaction 

10 between a test compound and DPP4 [Haseloff, (1988)]. 

Determining the ability of a test compound to bind to DPP4 also can be accomplished 
using a technology such as real-time Bimolecular Interaction Analysis (BIA) 
[McConnell, (1992); Sjolander, (1991)]. BIA is a technology for studying biospecific 
15 interactions in real time, without labeling any of the interactants (eg., BIAcore™). 
Changes in the optical phenomenon surface plasmon resonance (SPR) can be used as an 
indication of real-time reactions between biological molecules. 

In yet another aspect of the invention, a DPP4-like polypeptide can be used as a "bait 
20 protein" in a two-hybrid assay or three-hybrid assay [Szabo, (1995); U.S. 5,283,317), to 
identify other proteins which bind to or interact with DPP4 and modulate its activity. 

The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay 

25 utilizes two different DNA constructs. For example, in one construct, polynucleotide 
encoding DPP4 can be fused to a polynucleotide encoding the DNA binding domain of a 
known transcription factor (eg., GAL-4). In the other construct a DNA sequence that 
encodes an unidentified protein ("prey" or "sample") can be fused to a polynucleotide 
that codes for the activation domain of (he known transcription factor. If the "bait" and 

30 the "prey" proteins are able to interact in vivo to form an protein-dependent complex, the 
DNA-binding and activation domains of the transcription factor are brought into close 
proximity. This proximity allows transcription of a reporter gene (e.g., LacZ), which is 
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operably linked to a transcriptional regulatory site responsive to the transcription factor. 
Expression of the reporter gene can be detected, and cell colonies containing the 
functional transcription factor can be isolated and used to obtain the DNA sequence 
encoding the protein which interacts with DPP4. 

5 

It may be desirable to immobilize either the DPP4 (or polynucleotide) or the test 
compound to facilitate separation of the bound form from unbound forms of one or both 
of the interactants, as well as to accommodate automation of the assay. Thus, either the 
DPP4-like polypeptide (or polynucleotide) or the test compound can be bound to a solid 

10 support. Suitable solid supports include, but are not limited to, glass or plastic slides, 
tissue culture plates, microtiter wells, tubes, silicon chips, or particles such as beads 
(including, but not limited to, latex, polystyrene, or glass beads). Any method known in 
the art can be used to attach DPP4-like polypeptide (or polynucleotide) or test 
compound to a solid support, including use of covalent and non-covalent linkages, 

15 passive absorption, or pairs of binding moieties attached respectively to the polypeptide 
(or polynucleotide) or test compound and the solid support. Test compounds are 
preferably bound to the solid support in an array, so that the location of individual test 
compounds can be tracked. Binding of a test compound to DPP4 (or a polynucleotide 
encoding for DPP4) can be accomplished in any vessel suitable for containing the 

20 reactants. Examples of such vessels include microtiter plates, test tubes, and 
microcentrifuge tubes. 

In one embodiment, DPP4 is a fusion protein comprising a domain that allows binding 
of DPP4 to a solid support. For example, glutathione-S-transferase fusion proteins can 

25 be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, Mo.) or 
glutathione derivatized microtiter plates, which are then combined with the test 
compound or the test compound and the non-adsorbed DPP4; the mixture is then 
incubated under conditions conducive to complex formation (e.g., at physiological 
conditions for salt and pH). Following incubation, the beads or microtiter plate wells are 

30 washed to remove any unbound components. Binding of the interactants can be 
determined either directly or indirectly, as described above. Alternatively, the 
complexes can be dissociated from the solid support before binding is determined. 
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Other techniques for immobilizing proteins or polynucleotides on a solid support also 
can be used in the screening assays of the invention. For example, either DPP4 (or a 
polynucleotide encoding DPP4) or a test compound can be immobilized utilizing 

5 conjugation of biotin and streptavidin. Biotinylated DPP4 (or a polynucleotide encoding 
biotinylated DPP4) or test compounds can be prepared from biotin-NHS (N- 
hydroxysuccinimide) using techniques well known in the art (e.g., biotinylation kit, 
Pierce Chemicals, Rockford, 111.) and immobilized in the wells of streptavidin-coated 
plates (Pierce Chemical). Alternatively, antibodies which specifically bind to DPP4, 

10 polynucleotide, or a test compound, but which do not interfere with a desired binding 
site, such as the active site of DPP4, can be derivatized to the wells of the plate. 
Unbound target or protein can be trapped in the wells by antibody conjugation. 

Methods for detecting such complexes, in addition to those described above for the 
15 GST-immobilized complexes, include immunodetection of complexes using antibodies 
which specifically bind to DPP4 polypeptide or test compound, enzyme-linked assays 
which rely on detecting an activity of DPP4 polypeptide, and SDS gel electrophoresis 
under non-reducing conditions. 

20 Screening for test compounds which bind to a DPP4 polypeptide or polynucleotide also 
can be carried out in an intact cell. Any cell which comprises a DPP4 polypeptide or 
polynucleotide can be used in a cell-based assay system. A DPP4 polynucleotide can be 
naturally occurring in the cell or can be introduced using techniques such as those 
described above. Binding of the test compound to DPP4 or a polynucleotide encoding 

25 DPP4 is determined as described above. 

Functional Assays 

Test compounds can be tested for the ability to increase or decrease DPP4 activity of a 
30 DPP4 p olypeptide. T he D PP4 a ctivity c an b e m easured, for example, using methods 
described in the specific examples, below. DPP4 activity can be measured after 
contacting either a purified DPP4 or an intact cell with a test compound. A test 
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compound which decreases DPP4 activity by at least about 10, preferably about 50, 
more preferably about 75, 90, or 100% is identified as a potential agent for decreasing 
DPP4 activity. A test compound which increases DPP4 activity by at least about 10, 
preferably about 50, more preferably about 75, 90, or 100% is identified as a potential 
5 agent for increasing DPP4 activity. 

Gene Expression 

In another embodiment, test compounds which increase or decrease DPP4 gene 

10 expression are identified. As used herein, the term "correlates with expression of a poly- 
nucleotide" indicates that the detection of the presence of nucleic acids, the same or 
related to a nucleic acid sequence encoding DPP4, by northern analysis or realtime PCR 
is indicative of the presence of nucleic acids encoding DPP4 in a sample, and thereby 
correlates with expression of the transcript from the polynucleotide encoding DPP4. 

15 The term "microarray", as used herein, refers to an array of distinct polynucleotides or 
oligonucleotides arrayed on a substrate, such as paper, nylon or any other type of 
membrane, filter, chip, glass slide, or any other suitable solid support. A DPP4 
polynucleotide is contacted with a test compound, and the expression of an RNA or 
polypeptide product of DPP4 polynucleotide is determined. The level of expression of 

20 appropriate mKNA or polypeptide in the presence of the test compound is compared to 
the level of expression of mRNA or polypeptide in the absence of the test compound. 
The test compound can then be identified as a regulator of expression based on this 
comparison. For example, when expression of mRNA or polypeptide is greater in the 
presence of the test compound than in its absence, the test compound is identified as a 

25 stimulator or enhancer of the mRNA or polypeptide expression. A lternatively, when 
expression of the mRNA or polypeptide is less in the presence of the test compound than 
in its absence, the test compound is identified as an inhibitor of the mRNA or 
polypeptide expression. 

30 The level of DPP4 mRNA or polypeptide expression in the cells can be determined by 
methods well known in the art for detecting mRNA or polypeptide. Either qualitative or 
quantitative methods can be used. The presence of polypeptide products of DPP4 
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polynucleotide can be determined, for example, using a variety of techniques known in 
the art, including immunochemical methods such as radioimmunoassay, Western 
blotting, and immunohistochemistry. Alternatively, polypeptide synthesis can be 
determined in vivo, in a cell culture, or in an in vitro translation system by detecting 
5 incorporation of labelled amino acids into DPP4. 

Such screening can be carried out either in a cell-free assay system or in an intact cell. 
Any cell which expresses DPP4 polynucleotide can be used in a cell-based assay system. 
The DPP4 polynucleotide can b e n aturally o ccurring i n t he c ell o r c an b e i ntroduced 
10 using techniques such as those described above. Either a primary culture or an 
established cell line can be used. 

Test Compounds 

15 Suitable test compounds for use in the screening assays of the invention can be obtained 
from any suitable source, e.g., conventional compound libraries. The test compounds 
can also be obtained using any of the numerous approaches in combinatorial library 
methods known in the art, including: biological libraries; spatially addressable parallel 
solid phase or solution phase libraries; synthetic library methods requiring 

20 deconvolution; the "one-bead one-compound" library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to peptide libraries, while the other four approaches are applicable to peptide, 
non-peptide oligomer or small molecule libraries of compounds [Lam, (1997)]. 
Examples of methods for the synthesis of molecular libraries can be found in the art. 

25 Libraries of compounds may be presented in solution or on beads, bacteria, spores, 
plasmids or phage. 

Modeling of Regulators 

30 Computer modeling and searching technologies permit identification of compounds, or 
the improvement of already identified compounds, that can modulate DPP4 expression 
or activity. Having identified such a compound or composition, the active sites or 
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regions are identified. Such sites might typically be the enzymatic active site, regulator 
binding sites, or ligand binding sites. The active site can be identified using methods 
known in the art including, for example, from the amino acid sequences of peptides, 
from the nucleotide sequences of nucleic acids, or from study of complexes of the 
5 relevant compound or composition with its natural ligand. In the latter case, chemical or 
X-ray crystallographic methods can be used to find the active site by finding where on 
the factor the complexed ligand is found. 

Next, the three dimensional geometric structure of the active site is determined. This 
10 can be done by known methods, including X-ray crystallography, which can determine a 
complete molecular structure. O n the other hand, solid or liquid phase NMR can be 
used to determine certain intramolecular distances. Any other experimental method of 
structure determination can be used to obtain partial or complete geometric, structures. 
The geometric structures may be measured with a complexed ligand, natural or artificial, 
1 5 which may increase the accuracy of the active site structure determined. 

If an incomplete or insufficiently accurate structure is determined, the methods of 
computer based numerical modeling can be used to complete the structure or improve its 
accuracy. Any recognized modeling method may be used, including parameterized 

20 models specific to particular biopolymers such as proteins or nucleic acids, molecular 
dynamics models based on computing molecular motions, statistical mechanics models 
based on thermal ensembles, or combined models. For most types of models, standard 
molecular force fields, representing the forces between constituent atoms and groups, 
are necessary, and can be selected from force fields known in physical chemistry. The 

25 incomplete or less accurate experimental structures can serve as constraints on the 
complete and more accurate structures computed by these modeling methods. 

Finally, having determined the structure of the active site, either experimentally, by 
modeling, or by a combination, candidate modulating compounds can be identified by 
30 searching d atabases c ontaining c ompounds a long w ith i nformation o n t heir m olecular 
structure. Such a search seeks compounds having structures that match the determined 
active site structure and that interact with the groups defining the active site. Such a 
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search can be manual, but is preferably computer assisted. These compounds found 
from this search are potential DPP4 modulating compounds. 

Alternatively, these methods can be used to identify improved modulating compounds 
5 from an already known modulating compound or ligand. The composition of the known 
compound can be modified and the structural effects of modification can be determined 
using the experimental and computer modeling methods described above applied to the 
new composition. The altered structure is then compared to the active site structure of 
the compound to determine if an improved fit or interaction results. In this manner 
10 systematic variations in composition, such as by varying side groups, can be quickly 
evaluated to obtain modified modulating compounds or ligands of improved specificity 
or activity. 

Therapeutic Indications and Methods 

15 

It was found by the present applicant that DPP4 is expressed in various human tissues. 
Cardiovascular Disorders 

20 Heart failure is defined as a pathophysiological state in which an abnormality of cardiac 
function is responsible for the failure of the heart to pump blood at a rate commensurate 
with the requirement of the metabolizing tissue. It includes all forms of pumping failures 
such as high-output and low-output, acute and chronic, right-sided or left-sided, systolic 
or diastolic, independent of the underlying cause. 

25 

Myocardial infarction (MI) is generally caused by an abrupt decrease in coronary blood 
flow that follows a thrombotic occlusion of a coronary artery previously narrowed by 
arteriosclerosis. MI prophylaxis (primary and secondary prevention) is included as well 
as the acute treatment of MI and the prevention of complications. 

30 

Ischemic diseases are conditions in which the coronary flow is restricted resulting in a 
perfusion which is inadequate to meet the myocardial requirement for oxygen. This 
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group of diseases includes stable angina, unstable angina and asymptomatic ischemia. 

Arrhythmias include all forms of atrial and ventricular tachyarrhythmias, atrial 
tachycardia, atrial flutter, atrial fibrillation, atrioventricular reentrant tachycardia, 
5 preexitation syndrome, ventricular tachycardia, ventricular flutter, ventricular 
fibrillation, as well as bradycardic forms of arrhythmias. 

Hypertensive vascular diseases include primary as well as all kinds of secondary arterial 
hypertension, renal, endocrine, neurogenic, others. The genes may be used as drug 
10 targets for the treatment of hypertension as well as for the prevention of all 
complications arising from cardiovascular diseases. 

Peripheral vascular diseases are defined as vascular diseases in which arterial and/or 
venous flow is reduced resulting in an imbalance between blood supply and tissue 
15 oxygen demand. It includes chronic peripheral arterial occlusive disease (PAOD), acute 
arterial thrombosis and embolism, inflammatory vascular disorders, Raynaud's 
phenomenon and venous disorders. 

Atherosclerosis is a cardiovascular disease in which the vessel wall is remodeled, com- 
20 promising the lumen of the vessel. The atherosclerotic remodeling process involves 
accumulation of cells, both smooth muscle cells and monocyte/macrophage 
inflammatory cells, in the intima of the vessel wall. These cells take up lipid, likely from 
the circulation, to form a mature atherosclerotic lesion. Although the formation of these 
lesions is a chronic process, occurring over decades of an adult human life, the majority 
25 of the morbidity associated with atherosclerosis occurs when a lesion ruptures, releasing 
thrombogenic debris that rapidly occludes the artery. When such an acute event occurs 
in the coronary artery, myocardial infarction can ensue, and in the worst case, can result 
in death. 

30 The formation of the atherosclerotic lesion can be considered to occur in five 
overlapping stages such as migration, lipid accumulation, recruitment of inflammatory 
cells, proliferation of vascular smooth muscle cells, and extracellular matrix deposition. 
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Each of these processes can be shown to occur in man and in animal models of 
atherosclerosis, but the relative contribution of each to the pathology and clinical 
significance of the lesion is unclear. 

5 Thus, a need exists for therapeutic methods and agents to treat cardiovascular 
pathologies, such as atherosclerosis and other conditions related to coronary artery 
disease. 

Cardiovascular diseases include but are not limited to disorders of the heart and the 
10 vascular system like congestive heart failure, myocardial infarction, ischemic diseases 
of the heart, all kinds of atrial and ventricular arrhythmias, hypertensive vascular 
diseases, peripheral vascular diseases, and atherosclerosis. 

Too high or too low levels of fats in the bloodstream, especially cholesterol, can cause 
15 long-term problems. The risk to develop atherosclerosis and coronary artery or carotid 
arteiy disease (and thus the risk of having a heart attack or stroke) increases with the 
total cholesterol level increasing. Nevertheless, extremely low cholesterol levels may not 
be healthy. Examples of disorders of lipid metabolism are hyperlipidemia (abnormally 
high levels of fats (cholesterol, triglycerides, or both) in the blood, may be caused by 
20 family history of hyperlipidemia), obesity, a high-fat diet, lack of exercise, moderate to 
high alcohol consumption^ cigarette smoking, poorly controlled diabetes, and an 
underactive thyroid gland), hereditary hyperlipidemias (type I hyperlipoproteinemia 
(familial hyperchylomicronemia), type II hyperlipoproteinemia (familial 
hypercholesterolemia), type III hyperlipoproteinemia, type IV hyperlipoproteinemia, or 
25 type V hyperlipoproteinemia), hypolipoproteinemia, lipidoses (caused by abnormalities 
in the enzymes that metabolize fats), Gaucher's disease, Niemann-Pick disease, Fabry's 
disease, Wolman's disease, cerebrotendinous xanthomatosis, sitosterolemia, Refsum's 
disease, or Tay-Sachs disease. 

30 Kidney disorders may lead to hypertension or hypotension. Examples for kidney 
problems possibly leading to hypertension are renal artery stenosis, pyelonephritis, 
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glomerulonephritis, kidney tumors, polycistic kidney disease, injury to the kidney, or 
radiation therapy affecting the kidney. Excessive urination may lead to hypotension. 

The human DPP4 is highly expressed in the following cardiovascular related tissues: 
5 fetal heart, heart, interventricular septum, pericardium, heart atrium (right), coronary 
artery smooth muscle primaiy cells. Expression in the above mentioned tissues 
demonstrates that the human DPP4 or mRNA can be. utilized to diagnose of 
cardiovascular diseases. Additionally the activity of the human DPP4 can be modulated 
to treat cardiovascular diseases. 

10 

Hematological Disorders 

Hematological disorders comprise diseases of the blood and all its constituents as well 
as diseases of organs and tissues involved in the generation or degradation of all the 
15 constituents of the blood. They include but are not limited to 1) Anemias, 2) 
Myeloproliferative Disorders, 3) Hemorrhagic Disorders, 4) Leukopenia, 5) Eosinophilic 
Disorders, 6) Leukemias, 7) Lymphomas, 8) Plasma Cell Dyscrasias, 9) Disorders of the 
Spleen in the course of hematological disorders. Disorders according to 1) include, but 
are not limited to anemias due to defective or deficient hem synthesis, deficient 
20 erythropoiesis. Disorders according to 2) include, but are not limited to polycythemia 
vera, tumor-associated erythrocytosis, myelofibrosis, thrombocythemia. Disorders 
according to 3) include, but are not limited to vasculitis, thrombocytopenia, heparin- 
induced thrombocytopenia, thrombotic thrombocytopenic purpura, hemolytic-uremic 
syndrome, hereditary and acquired disorders of platelet function, hereditary coagulation 
25 disorders. Disorders according to 4) include, but are not limited to neutropenia, 
lymphocytopenia. Disorders according to 5) include, but are not limited to 
hypereosinophilia, idiopathic hypereosinophilic syndrome. Disorders according to 6) 
include, but are not limited to acute myeloic leukemia, acute lymphoblastic leukemia, 
chronic myelocytic leukemia, chronic lymphocytic leukemia, myelodysplastic 
30 syndrome. Disorders according to 7) include, but are not limited to Hodgkin's disease, 
non-Hodgkin's lymphoma, Burkitt's lymphoma, mycosis fungoides cutaneous T-cell 
lymphoma. Disorders according to 8) include, but are not limited to multiple myeloma, 
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macroglobulinemia, heavy chain diseases. In extension of the preceding idiopathic 
thrombocytopenic purpura, iron deficiency anemia, megaloblastic anemia (vitamin B12 
deficiency), aplastic anemia, thalassemia, malignant lymphoma bone marrow invasion, 
malignant lymphoma skin invasion, hemolytic uremic syndrome, giant platelet disease 
5 are considered to be hematological diseases too. 

The human DPP4 is highly expressed in the following tissues of the hematological 
system: leukocytes (peripheral blood), bone marrow stromal cells, spleen and 
neutrophils cord blood. The expression in the above mentioned tissues demonstrates 
10 that the human DPP4 or mRNA can be utilized to diagnose of hematological diseases. 
Additionally the activity of the human DPP4 can be modulated to treat hematological 
disorders. 

Gastrointestinal and Liver Diseases 

15 

Gastrointestinal diseases comprise primary or secondary, acute or chronic diseases of the 
organs of the gastrointestinal tract which may be acquired or inherited, benign or 
malignant or metaplastic, and which may affect the organs of the gastrointestinal tract or 
the body as a whole. They comprise but are not limited to 1) disorders of the esophagus 
20 like achalasia, vigoruos achalasia, dysphagia, cricopharyngeal incoordination, pre- 
esophageal dysphagia, diffuse esophageal spasm, globus sensation, Barrett's metaplasia, 
gastroesophageal reflux, 2) disorders of the stomach and duodenum like functional 
dyspepsia, inflammation of the gastric mucosa, gastritis, stress gastritis, chronic erosive 
gastritis, atrophy of gastric glands, metaplasia of gastric tissues, gastric ulcers, duodenal 
25 ulcers, neoplasms of the stomach, 3) disorders of the pancreas like acute or chronic 
pancreatitis, insufficiency of the exocrinic or endocrinic tissues of the pancreas like 
steatorrhea, diabetes, neoplasms of the exocrine or endocrine pancreas like 3.1) multiple 
endocrine neoplasia syndrome, ductal adenocarcinoma, cystadenocarcinoma, islet cell 
tumors, insulinoma, gastrinoma, carcinoid tumors, glucagonoma, Zollinger-EUison 
30 syndrome, Vipoma syndrome, malabsorption syndrome, 4) disorders of the bowel like 
chronic inflammatory diseases of the bowel, Crohn's disease, ileus, diarrhea and 
constipation, colonic inertia, megacolon, malabsorption syndrome, ulcerative colitis, 
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4.1) functional bowel disorders like irritable bowel syndrome, 4.2) neoplasms of the 
bowel like familial polyposis, adenocarcinoma, primary malignant lymphoma, carcinoid 
tumors, Kaposi's sarcoma, polyps, cancer of the colon and rectum. 

5 Liver diseases comprise primary or secondary, acute or chronic diseases or injury of the 
liver which may be acquired or inherited, benign or malignant, and which may affect the 
liver or the body as a whole. They comprise but are not limited to disorders of the 
bilirubin metabolism, jaundice, syndroms of Gilbert's, Crigler-Najjar, Dubin- Johnson 
and Rotor; intrahepatic cholestasis, hepatomegaly, portal hypertension, ascites, Budd- 

10 Chiari syndrome, portal-systemic encephalopathy, fatty liver, steatosis, Reye's 
syndrome, liver diseases due to alcohol, alcoholic hepatitis or cirrhosis, fibrosis and 
cirrhosis, fibrosis and cirrhosis of the liver due to inborn errors of metabolism or 
exogenous substances, storage diseases, syndromes of Gaucher' s, Zellweger's, Wilson's 
- disease, acute or chronic hepatitis, viral hepatitis and its variants, inflammatory 

15 conditions of the liver due to viruses, bacteria, fungi, protozoa, helminths; drug induced 
disorders of the liver, chronic liver diseases like primary sclerosing cholangitis, alpha r 
antitrypsin-deficiency, primary biliary cirrhosis, postoperative liver disorders like 
postoperative intrahepatic cholestasis, hepatic granulomas, vascular liver disorders 
associated with systemic disease, benign or malignant neoplasms of the liver, 

20 disturbance of liver metabolism in the new-born or prematurely born. 

The human DPP4 is highly expressed in the following tissues of the gastroenterological 
system: stomach tumor, small intestine, ileum, rectum, salivary gland, liver, liver tumor, 
HEP G2 cells. The expression in the above mentioned tissues and in particular the 
25 differential expression between diseased tissue stomach tumor and healthy tissue 
stomach, between diseased tissue liver tumor and healthy tissue liver, between diseased 
tissue HEP G2 cells and healthy tissue liver demonstrates that the human DPP4 or 
mRNA can be utilized to diagnose of gastroenterological disorders. Additionally the 
activity of the human DPP4 can be modulated to treat gastroenterological disorders. 



30 
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Dermatologic Disorders 

The skin serves several functions. It's an multi-layered organ system that huilds an 
effective protective cover and regulates body temperature, senses painful and pleasant 
5 stimuli, keeps substances from entering the body, and provides a shield from the sun's 
harmful e fleets. S kin c olor, t exture, a nd f olds h elp m ark p eople a s individuals. Thus, 
skin disorders or diseases often have important consequences for physical and mental 
health. Skin disorders include, but are not limited to the conditions described in the 
following. 

10 

Itching (pruritus) is a sensation that instinctively demands scratching, which may be 
caused by a skin condition or a systemic diseas. 

Superficial Skin Disorders affect the uppermost layer of the skin, the stratum corneum or 
15 the keratin layer, and it consists of many layers of flattened, dead cells and acts as a 
barrier to protect the underlying tissue from injury and infection. Disorders of the 
superficial skin layers involve the stratum corneum and deeper layers of the epidermis. 

Examples of superficial skin disorders are provided in the following. 

20 

Dry skin often occurs in people past middle age, severe dry skin (ichthyosis) results from 
an inherited scaling disease, such as ichthyosis vulgaris or epidermolytic hyperkeratosis. 
Ichthyosis also results from nonhereditary disorders, such as leprosy, underactive 
thyroid, lymphoma, AIDS, and sarcoidosis. 

25 

Keratosis pilaris is a common disorder in which dead cells shed from the upper layer of 
skin and form plugs that fill the openings of hair follicles. 



30 



A callus is an area on the stratum corneum or keratin layer, that becomes abnormally 
thick in response to repeated rubbing. 

A corn is a pea-sized, thickened area of keratin that occurs on the feet. 
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Psoriasis is a chronic, recurring disease recognizable by silvery scaling bumps and 
various-sized plaques (raised patches). An abnormally high rate of growth and turnover 
of skin cells causes the scaling. 

5 Pityriasis rosea is a mild disease that causes scaly, rose-colored, inflamed skin. Pityriasis 
rosea is possibly caused by an infectious agent, although none has been identified. 

Lichen planus, a recurring itchy disease, starts as a rash of small discrete bumps that 
then combine and become rough, scaly plaques (raised patches). 

Dermatitis (eczema) is an inflammation of the upper layers of the skin, causing blisters, 
redness, swelling, oozing, scabbing, scaling, and usually itching. 

Forms of dermatitis are contact dermatitis, or chronic dermatitis of the hands and feet, 
15 e.g. Pompholyx. 

Further examples of dermatitic disorders are atopic dermatitis, seborrheic dermatitis, 
nummular dermatitis, generalized exfoliative dermatitis, stasis dermatitis, or localized 
scratch dermatitis (lichen simplex chronicus, neurodermatitis). 

20 

Other skin disorders are caused by inflammation. The skin can break out in a variety of 
rashes, sores, and blisters. Some skin eruptions can even be life threatening. 

Drug rashes are side effects of medications, mainly allergic reactions to medications. 

25 

Toxic epidermal necrolysis is a life-threatening skin disease in which the top layer of the 
skin peels off in sheets. This condition can be caused by a reaction to a drug, or by some 
other serious disease. 

30 Erythema multiforme, often caused by herpes simplex is a disorder characterized by 
patches of red, raised skin that often look like targets and usually are distributed 
symmetrically over the body. 
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Erythema nodosum is an inflammatory disorder that produces tender red bumps 
(nodules) under the skin, most often over the shins but occasionally on the arms and 
other areas. 

5 

Granuloma annulare is a chronic skin condition of unknown cause in which small, firm, 
raised bumps form a ring with normal or slightly sunken skin in the center. 

Some skin disorders are characterized as blistering diseases. Three autoimmune 
10 diseases-pemphigus, bullous pemphigoid, and dermatitis herpetiformis-are among the 
most serious. 

Pemphigus is an uncommon, sometimes fatal, disease in which blisters (bullae) of 
varying sizes break out on the skin, the lining of the mouth, and other mucous 
15 membranes. 

Bullous pemphigoid is an autoimmune disease that causes blistering. 

Dermatitis herpetiformis is an autoimmune disease in which clusters of intensely itchy, 
20 small blisters and hive-like swellings break out and persist. In people with the disease, 
proteins in wheat, rye, barley, and oat products activate the immune system, which 
attacks parts of the skin and somehow causes the rash and itching. 

Sweating disorders also belong to skin disorders. 

25 

Prickly heat is an itchy skin rash caused by trapped sweat. 

Excessive sweating (hyperhidrosis) may affect the entire surface of the skin, but often 
it's limited to the palms, soles, aimpits, or groin. The affected area is often pink or bluish 
30 white, and in severe cases the skin may be cracked, scaly, and soft, especially on the 
feet. 
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Skin disorders can affect the sebaceous glands. The sebaceous glands, which secrete oil 
onto the skin, lie in the dermis, the skin layer just below the surface layer (epidermis). 
Sebaceous gland disorders include acne, rosacea, perioral dermatitis, and sebaceous 
cysts. 

5 

Acne is a common skin condition in which the skin pores become clogged, leading to 
pimples and inflamed, infected abscesses (collections of pus). Acne tends to develop in 
teenagers. 

1 0 Acne is further subdivided in superficial acne or deep acne. 

Rosacea is a persistent skin disorder that produces redness, tiny pimples, and broken 
blood vessels, usually on the central area of the face. 

15 Perioral dermatitis is a red, often bumpy rash around the mouth and on the chin. 

A sebaceous cyst (keratinous cyst) is a slow-growing bump containing dead skin, skin 
excretions, and other skin particles. These cysts may be small and can appear anywhere. 

20 Hair Disorders also are skin disorders. Hair disorders include excessive hairiness, 
baldness, and ingrown beard hairs. 

The skin can be infected by bacteria. Bacterial skin infections can range in seriousness 
from minor acne to a life-threatening condition, such as staphylococcal scalded skin 
25 syndrome. The most common bacterial skin infections are caused by Staphylococcus 
and Streptococcus. Risk factors for skin infections are for example diabetes, AIDS or 
skin leasons. 

Impetigo is a skin infection, caused by Staphylococcus or Streptococcus, leading to the 
30 formation of small pus-filled blisters (pustules). 
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Folliculitis is an inflammation of the hair follicles caused by infection with 
Staphylococcus. The infection damages the hairs, which can be easily pulled out. 

Boils (furuncles) are large, tender, swollen, raised areas caused by staphylococcal 
5 infection around hair follicles. 

Carbuncles are clusters of boils that result in extensive sloughing of skin and scar 
formation. Carbuncles develop and heal more slowly than single boils and may lead to 
fever and fatigue. 

10 

Erysipelas is a skin infection caused by Streptococcus. A shiny, red, slightly swollen, 
tender rash develops, often with small blisters. Lymph nodes around the infected area 
may become enlarged and painful. 

15 Cellulitis is a spreading infection in, and sometimes beneath, the deep layers of the skin. 
Cellulitis most often results from a streptococcal infection or a staphylococcal infection. 
However, many other bacteria can also cause cellulitis. 

Paronychia is an infection around the edge of a fingernail or toenail. Paronychia can be 
20 caused by many different bacteria, including Pseudomonas and Proteus, and by fungi, 
such as Candida. 

Staphylococcal scalded skin syndrome is a widespread skin infection that can lead to 
toxic shock syndrome, in which the skin peels off as though burned. Certain types of 
25 staphylococci produce a toxic substance that causes the top layer of skin (epidermis) to 
split from the rest of the skin. 

Erythrasma is an infection of the top layers of the skin by the bacterium 
Corynebacterium minutissimum. 

30 

Skin infections are often caused by fungi. Fungi that infect the skin (dermatophytes) live 
only in the dead, topmost layer (stratum coraeum) and don't penetrate deeper. Some 
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fungal infections cause no symptoms or produce only a small amount of irritation, 
scaling, and redness. Other fungal infections cause itching, swelling, blisters, and severe 
scaling. 

5 Ringworm is a fungal skin infection caused by several different fungi and generally 
classified by its location on the body. 

Examples are Athlete's foot (foot ringworm, caused by either Trichophyton or 
Epidermophyton), jock itch (groin ringworm, can be caused by a variety of fungi and 
10 yeasts), scalp ringworm, caused by Trichophyton or Microsporum), nail ringworm and 
body ringworm (caused by Trichophyton). 

Candidiasis (yeast infection, moniliasis) is an infection by the yeast Candida. Candida 
usually infects the skin and mucous membranes, such a s the lining of the mouth and 

15 vagina. Rarely, it invades deeper tissues as well as the blood, causing life-threatening 
systemic candidiasis. The following types of Candida infections can be distinguished: 
Infections in skinfolds (intertriginous infections), vaginal and penile Candida infections 
(vulvovaginitis), thrush, Perleche (candida infection at the corners of the mouth), 
candidal paronychia (candida growing in the nail beds, produces painful swelling and 

20 pus). 

Tinea versicolor is a fungal infection that causes white to light brown patches on the 
skin. 

25 The skin can also be affected by parasites, mainly tiny insects or worms. 

Scabies is a mite infestation that produces tiny reddish pimples and severe itching. 
Scabies is caused by the itch mite Sarcoptes scabiei. 

30 Lice infestation (pediculosis) causes intense itching and can affect almost any area of the 
skin. Head lice and pubic lice are two different species. 
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Creeping eruption (cutaneous larva migrans) is a hookworm infection transmitted from 
warm, moist soil to exposed skin. The infection is caused by a hookworm that normally 
inhabits dogs and cats. 

5 Many types of viruses invade the skin. The medically important once cause warts and 
cold sores (fever blisters) on the lip. Warts are caused by the papillomavirus, and cold 
sores are c aused b y t he h erpes s implex v irus. Another i mportant g roup o f v iruses t hat 
infect the skin belongs to the poxvirus family. Chickenpox remains a common childhood 
infection. A poxvirus also causes molluscum contagiosum, which is an infection of the 
1 0 skin by a poxvirus that causes skin-colored, smooth, waxy bumps. 

Sunlight can cause severe skin damage. Sunburn results from an overexposure to 
ultraviolet B (UVB) rays. Some sunburned people develop a fever, chills, and weakness, 
and those with very bad sunburns even may go into shock-low blood pressure, and 
15 fainting. 

People who are in the sun a lot have an increased risk of skin cancers, including 
squamous cell carcinoma, basal cell carcinoma, and to some degree, malignant 
melanoma. 

20 

Drugs, among other causes, can cause skin photosensitivity reactions which can occur 
after only a few minutes of sun exposure. These reactions include redness, peeling, 
hives, blisters, and thickened, scaling patches (photosensitivity). 

25 Some skin disorders are characterized as Pigment Disorders. 

Albinism is a rare, inherited disorder in which no melanin is formed. 

Vitiligo is a condition in which a loss of melanocytes results in smooth, whitish patches 
30 of skin, which may occur after unusual physical trauma and tends to occur with certain 
other diseases, including Addison's disease, diabetes, pernicious anemia, and thyroid 
disease. 
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Tinea versicolor is a fungal infection of the skin that sometimes results in 
hyperpigmentation. 

5 Melasma appears on the face (usually the forehead, cheeks, temples, and jaws) as a 
roughly symmetric group of dark brown patches of pigmentation that are often clearly 
delineated. 

Skin growths, which are abnormal accumulations of different types of cells, may be 
10 present at birth or develop later. Noncancerous (benign) growth and cancerous 
(malignant) growth types are distinguished. 

Moles (nevi) are small, usually dark, skin growths that develop from pigment-producing 
cells in the skin (melanocytes). Most moles are harmless. However, noncancerous moles 
1 5 can develop into malignant melanoma. 

Skin tags are soft, small, flesh-colored or slightly darker skin flaps that appear mostly on 
the neck, in the armpits, or in the groin. 

20 Lipomas are soft deposits of fatty material that grow under the skin, causing round or 
oval lumps. 

Angiomas are collections of abnormally dense blood or lymph vessels that are usually 
located in and below the skin and that cause red or purple discolorations. 

25 

Examples of angiomas are port-wine stains, strawberry marks, cavernous hemangiomas, 
spider angiomas, and lymphangiomas. 

Pyogenic granulomas are scarlet, brown, or blue-black slightly raised areas caused by 
30 increased growth of capillaries (the smallest blood vessels) and swelling of the 
surrounding tissue. 
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Seborrheic keratoses (sometimes called seborrheic warts) are flesh-colored, brown, or 
black growths that can appear anywhere on the skin. 

Dermatofibromas are small, red-to-brown bumps (nodules) that result from an 
5 accumulation of fibroblasts, the cells that populate the soft tissue under the skin. 

Keratoacanthomas are round, firm, usually flesh-colored growths that have an unusual 
central crater containing a pasty material. 

10 Keloids are smooth, shiny, slightly pink, often dome-shaped, proliferative growths of 
fibrous tissue that form over areas of injury or over surgical wounds. 

Skin cancer is the most common form of cancer, but most types of skin cancers are 
curable. 

15 

Basal cell carcinoma is a cancer that originates in the lowest layer of the epidermis. 

Squamous cell carcinoma is cancer that originates in the middle layer of the epidermis. 

20 Bowen's disease is a form of squamous cell carcinoma that's confined to the epidermis 
and hasn't yet invaded the underlying dermis. 

Melanoma is a cancer that originates in the pigment-producing cells of the skin 
(melanocytes). 

25 

Kaposi's sarcoma is a cancer that originates in the blood vessels, usually of the skin. 

Paget's disease is a rare type of skin cancer that looks like an inflamed, reddened patch 
of skin (dermatitis); it originates in glands in or under the skin. 

30 

The human DPP4 is highly expressed in the following dermatological tissues: skin. The 
expression in the above mentioned tissues demonstrates that the human DPP4 or mRNA 
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can be utilized to diagnose of dermatological diseases. Additionally the activity of the 
human DPP4 can be modulated to treat those diseases. 

Cancer Disorders 

5 

Cancer disorders within the scope of this definition comprise any disease of an organ or 
tissue in mammals characterized by poorly controlled or uncontrolled multiplication of 
normal or abnormal cells in that tissue and its effect on the body as a whole. Cancer 
diseases within the scope of the definition comprise benign neoplasms, dysplasias, 

10 hyperplasias as well as neoplasms showing metastatic growth or any other 
transformations like e.g. leukoplakias which often precede a breakout of cancer. Cells 
and tissues are cancerous when they grow more rapidly than normal cells, displacing or 
spreading into the surrounding healthy tissue or any other tissues of the body described 
as metastatic growth, assume abnormal shapes and sizes, show changes in their 

15 nucleocytoplasmatic ratio, nuclear polychromasia, and finally may cease. Cancerous 
cells and tissues may affect the body as a whole when causing paraneoplastic syndromes 
or if cancer occurs within a vital organ or tissue, normal function will be impaired or 
halted, with possible fatal results. The ultimate involvement of a vital organ by cancer, 
either primary or metastatic, may lead to the death of the mammal affected. Cancer 

20 tends to spread, and the extent of its spread is usually related to an individual's chances 
of surviving the disease. Cancers are generally said to be in one of three stages of 
growth: early, or localized, when a tumor is still confined to the tissue of origin, or 
primary site; direct extension, where cancer cells from the tumour have invaded adjacent 
tissue or have spread only to regional lymph nodes; or metastasis, in which cancer cells 

25 have migrated to distant parts of the body from the primary site, via the blood or lymph 
systems, and have established secondary sites of infection. Cancer is said to be 
malignant because of its tendency to cause death if not treated. Benign tumors usually do 
not cause death, although they may if they interfere with a normal body function by 
virtue of their location, size, or paraneoplastic side effects. Hence benign tumors fall 

30 under the definition of cancer within the scope of this definition as well. In general, 
cancer cells divide at a higher rate than do normal cells, but the distinction between the 
growth of cancerous and normal tissues is not so much the rapidity of cell division in the 
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former as it is the partial or complete loss of growth restraint in cancer cells and their 
failure to differentiate into a useful, limited tissue of the type that characterizes the 
functional equilibrium of growth of normal tissue. Cancer tissues may express certain 
molecular receptors and probably are influenced by the host's susceptibility and 

5 immunity and it is known that certain cancers of the breast and prostate, for example, are 
considered dependent on specific hormones for their existence. The term "cancer" under 
the scope of the definition is not limited to simple benign neoplasia but comprises any 
other benign and malign neoplasia like 1) Carcinoma, 2) Sarcoma, 3) Carcinosarcoma, 
4) Cancers of the blood-forming tissues, 5) tumors of nerve tissues including the brain, 

10 6) cancer of skin cells. Cancer according to 1) occurs in epithelial tissues, which cover 
the outer body (the skin) and line mucous membranes and the inner cavitary structures 
of organs e.g. such as the breast, lung, the respiratory and gastrointestinal tracts, the 
endocrine glands, and the genitourinary system. Ductal or glandular elements may 
persist in epithelial tumors, as in adenocarcinomas like e.g. thyroid adenocarcinoma, 

15 gastric adenocarcinoma, uterine adenocarcinoma. Cancers of the pavement-cell 
epithelium of the skin and of certain mucous membranes, such as e.g. cancers of the 
tongue, lip, larynx, urinary bladder, uterine cervix, or penis, may be termed epidermoid 
or squamous-cell carcinomas of the respective tissues and are in the scope of the 
definition of cancer as well. Cancer according to 2) develops in connective tissues, 

20 including fibrous tissues, adipose (fat) tissues, muscle, blood vessels, bone, and cartilage 
like e.g. osteogenic sarcoma; liposarcoma, fibrosarcoma, synovial sarcoma. Cancer 
according to 3) is cancer that develops in both epithelial and connective tissue. Cancer 
disease within the scope of this definition may be primary or secondary, whereby 
primary indicates that the cancer originated in the tissue where it is found rather than 

25 was established as a secondary site through metastasis from another lesion. Cancers and 
tumor diseases within the scope of this definition may be benign or malign and may 
affect all anatomical structures of the body of a mammal. By example but not limited to 
they comprise cancers and tumor diseases of I) the bone marrow and bone marrow 
derived cells (leukemias), II) the endocrine and exocrine glands like e.g. thyroid, 

30 parathyroid, pituitary, adrenal glands, salivary glands, pancreas HI) the breast, like e.g. 
benign or malignant tumors in the mammary glands of either a male or a female, the 
mammary ducts, adenocarcinoma, medullary carcinoma, comedo carcinoma, Paget's 
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disease of the nipple, inflammatory carcinoma of the young woman, IV) the lung, V) the 
stomach, VI) the liver and spleen, VII) the small intestine, VIII) the colon, DC) the bone 
and its supportive and connective tissues like malignant or benign bone tumour, e.g. 
malignant osteogenic sarcoma, benign osteoma, cartilage tumors; like malignant 

5 chondrosarcoma or benign chondroma; bone marrow tumors like malignant myeloma or 
benign eosinophilic granuloma, as well as metastatic tumors from bone tissues at other 
locations of the body, X) the mouth, throat, larynx, and the esophagus, XI) the urinary 
bladder and the internal and external organs and structures of the urogenital system of 
male and female like ovaries, uterus, cervix of the uterus, testes, and prostate gland, XII) 

10 the prostate, XIII) the pancreas, like ductal carcinoma of the pancreas; XIV) the 
lymphatic tissue like lymphomas and other tumors of lymphoid origin, XV) the skin, 
XVI) cancers and tumor diseases of all anatomical structures belonging to the respiration 
and respiratory systems including thoracal muscles and linings, XVII) primary or 
secondary cancer of the lymph nodes XVIII) the tongue and of the bony structures of the 

15 hard palate or sinuses, XVTV) the mouth, cheeks, neck and salivary glands, XX) the 
blood vessels including the heart and their linings, XXI) the smooth or skeletal muscles 
and their ligaments and linings, XXII) the peripheral, the autonomous, the central 
nervous system including the cerebellum, XXIII) the adipose tissue. 

20 The human DPP4 is highly expressed in the following cancer tissues: stomach tumor, 
liver tumor, HEP G2 cells, uterus tumor, prostata, kidney tumor. The expression in the 
above mentioned tissues and in particular the differential expression between diseased 
tissue stomach tumor and healthy tissue stomach, between diseased tissue liver tumor 
and healthy tissue liver, between diseased tissue HEP G2 cells and healthy tissue liver, 

25 between diseased tissue uterus tumor and healthy tissue uterus, between diseased tissue 
kidney tumor and healthy tissue kidney demonstrates that the human DPP4 or mRNA 
can be utilized to diagnose of cancer. Additionally the activity of the human DPP4 can 
be modulated to treat cancer. 
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Disorders Related to Pulmology 

Asthma is thought to arise as a result of interactions between multiple genetic and 
environmental factors and is characterized by three major features: 1) intermittent and 

5 reversible airway obstruction caused by bronchoconstriction, increased mucus 
production, and thickening of the walls of the airways that leads to a narrowing of the 
airways, 2) airway hyperresponsiveness, and 3) airway inflammation. Certain cells are 
critical to the inflammatory reaction of asthma and they include T cells and antigen 
presenting cells, B cells that produce IgE, and mast cells, basophils, eosinophils, and 

10 other cells that bind IgE. These effector cells accumulate at the site of allergic reaction 
in the airways and release toxic products that contribute to the acute pathology and 
eventually to tissue destruction related to the disorder. Other resident cells, such as 
smooth muscle cells, lung epithelial cells, mucus-producing cells, and nerve cells may 
also be abnormal in individuals with asthma and may contribute to its pathology. While 

15 the airway obstruction of asthma, presenting clinically as an intermittent wheeze and 
shortness of breath, is generally the most pressing symptom of the disease requiring 
immediate treatment, the inflammation and tissue destruction associated with the disease 
can lead to irreversible changes that eventually make asthma a chronic and disabling 
disorder requiring long-term management. 

20 

Chronic obstructive pulmonary (or airways) disease (COPD) is a condition defined 
physiologically as airflow obstruction that generally results from a mixture of 
emphysema and peripheral airway obstruction due to chronic bronchitis [Botstein, 
1980]. Emphysema is characterised by destruction of alveolar walls leading to abnormal 
25 enlargement of the air spaces of the lung. Chronic bronchitis is defined clinically as the 
presence of chronic productive cough for three months in each of two successive years. 
In COPD, airflow obstruction is usually progressive and is only partially reversible. By 
far the most important risk factor for development of COPD is cigarette smoking, 
although the disease does also occur in non-smokers. 

30 

The human DPP4 is highly expressed in the following tissues of the respiratory system: 
leukocytes (peripheral blood), fetal lung, fetal lung fibroblast IMR-90 cells, fetal lung 
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fibroblast MRC-5 cells, lung lupus disease. The expression in the above mentioned 
tissues and in particular the differential expression between diseased tissue fetal lung 
fibroblast IMR-90 cells and healthy tissue fetal lung, between diseased tissue fetal lung 
fibroblast MRC-5 cells and healthy tissue fetal lung, between diseased tissue lung lupus 
5 disease and healthy tissue lung demonstrates that the h uman D PP4 o r m RNA c an b e 
utilized to diagnose of respiratory diseases. Additionally the activity of the human DPP4 
can be modulated to treat those diseases. 

Disorders Related to Urology 

10 

Genitourinary disorders comprise benign and malign disorders of the organs constituting 
the genitourinary system of female and male, renal diseases like acute or chronic renal 
failure, immunologically mediated renal diseases like renal transplant rejection, lupus 
nephritis, immune complex renal diseases, glomerulopathies, nephritis, toxic 
15 nephropathy, obstructive uropathies like benign prostatic hyperplasia (BPH), neurogenic 
bladder syndrome, urinary incontinence like urge-, stress-, or overflow incontinence, 
pelvic pain, and erectile dysfunction. 

The human DPP4 is highly expressed in the following urological tissues: prostata, 
20 protate BPH, fetal kidney, kidney, kidney tumor, HEK 293 cells. The expression in the 
above mentioned tissues and in particular the differential expression between diseased 
tissue kidney tumor and healthy tissue kidney, the differential expression between 
diseased tissue prostate BPH and healthy tissue prostate demonstrates that the human 
DPP4 or mRNA can be utilized to diagnose of urological disorders. Additionally the 
25 activity of the human DPP4 can be modulated to treat urological disorders. 

Metabolic Disorders 

Metabolic diseases are defined as conditions which result from an abnormality in any of 
30 the chemical or biochemical transformations and their regulating systems essential to 
producing energy, to regenerating cellular constituents, to eliminating unneeded 
products arising from these processes, and to regulate and maintain homeostasis in a 
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mammal regardless of whether acquired or the result of a genetic transformation. 
Depending on which hietabolic pathway is involved, a single defective transformation or 
disturbance of its jegulation may produce consequences that are narrow, involving a 
single body function, or broad, affecting many organs, organ-systems or the body as a 
5 whole. Diseases resulting from abnormalities related to the fine and coarse mechanisms 
that affect each individual transformation, its rate and direction or the availability of 
substrates like amino acids, fatty acids, carbohydrates, minerals, cofactors, hormones, 
regardless whether, they are inborn or acquired, are well within the scope of the 
definition of a metabolic disease according to this application. 

10 

Metabolic diseases often are caused by single defects in particular biochemical 
pathways, defects that are due to the deficient activity of individual enzymes or 
molecular receptors leading to the regulation of such enzymes. Hence in a broader sense 
disturbances of the underlying genes, their products and their regulation lie well within 
15 the scope of this definition of a metabolic disease. For example, but not limited to, 
metabolic diseases may affect 1) biochemical processes and tissues ubiquitous all over 
the body, 2) the bone, 3) the nervous system, 4) the endocrine system, 5) the muscle 
including the heart, 6) the skin and nervous tissue, 7) the urogenital system, 8) the 
homeostasis of body systems like water and electrolytes. For example, but not limited to, 
20 metabolic diseases according to 1) comprise obesity, amyloidosis, disturbances of the 
amino acid metabolism like branched chain disease, hyperaminoacidemia, 
hyperaminoaciduria, disturbances of the metabolism of urea, hyperammonemia, 
mucopolysaccharidoses e.g. Maroteaux-Lamy syndrom, storage diseases like glycogen 
storage diseases and lipid storage diseases, glycogenosis diseases like Con's disease, 
25 malabsorption diseases like intestinal carbohydrate malabsorption, oligosaccharidase 
deficiency like maltase-, lactase-, sucrase-insufficiency, disorders of the metabolism of 
fructose, disorders of the metabolism of galactose, galactosaemia, disturbances of 
carbohydrate utilization like diabetes, hypoglycemia, disturbances of pyruvate 
metabolism, hypolipidemic hypolipoproteinemia, hyperlipidemia, 

30 hyperlipoproteinemia, carnitine or carnitine acyltransferase deficiency, disturbances of 
the porphyrin metabolism, porphyrias, disturbances of the purine metabolism, lysosomal 
diseases, metabolic diseases of nerves and nervous systems like gangliosidoses, 
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sphingolipidoses, sulfatidoses, leucodystrophies, Lesch-Nyhan syndrome. For example, 
but not limited to, metabolic diseases according to 2) comprise osteoporosis, 
osteomalacia like osteoporosis, osteopenia, osteogenesis imperfecta, osteopetrosis, 
osteonecrosis, Paget's disease of bone, hypophosphatemia. For example, but not limited 
5 to, metabolic diseases according to 3) comprise cerebellar dysfunction, disturbances of 
brain metabolism like dementia, Alzheimer's disease, Huntington's chorea, Parkinson's 
disease, Pick's disease, toxic encephalopathy, demyelinating neuropathies like 
inflammatory neuropathy, Guillain-BairS syndrome. For example, but not limited to, ' 
metabolic diseases according to 4) comprise primary and secondary metabolic disorders 
10 associated with hormonal defects like any disorder stemming from either an 
hyperfiinction or hypofunction of some hormone-secreting endocrine gland and any 
combination thereof. They comprise Sipple's syndrome, pituitary gland dysfunction and 
its effects on other endocrine glands, such as the thyroid, adrenals, ovaries, and testes, 
acromegaly, hyper- and hypothyroidism, euthyroid goiter, euthyroid sick syndrome, 
15 thyroiditis, and thyroid cancer, over- or underproduction of the adrenal steroid 
hormones, adrenogenital syndrome, Cushing's syndrome, Addison's disease of the 
adrenal cortex, Addison's pernicious anemia, primary and secondary aldosteronism, 
diabetes insipidus, carcinoid syndrome, disturbances caused by the dysfunction of the 
parathyroid glands, pancreatic islet cell dysfunction, diabetes, disturbances of the 
20 endocrine system of the female like estrogen deficiency, resistant ovary syndrome. For 
example, but not limited to, metabolic diseases according to 5) comprise muscle 
weakness, myotonia, Duchenne's and other muscular dystrophies, dystrophia myotonica 
of Steinert, mitochondrial myopathies like distuibances of the catabolic metabolism in 
the muscle, carbohydrate and lipid storage myopathies, glycogenoses, myoglobinuria, 
25 malignant hyperthermia, polymyalgia rheumatica, dermatomyositis, primary myocardial 
disease, cardiomyopathy. For example, but not limited to, metabolic diseases according 
to 6) comprise disorders of the ectoderm, neurofibromatosis, scleroderma and 
polyarteritis, Louis-Bar syndrome, von Hippel-Lindau disease, Sturge-Weber syndrome, 
tuberous sclerosis, amyloidosis, porphyria. For example, but not limited to, metabolic 
30 diseases according to 7) comprise sexual dysfunction of the male and female. For 
example, but not limited to, metabolic diseases according to 8) comprise confused states 
and seizures due to inappropriate secretion of antidiuretic hormone from the pituitary 
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gland, Liddle's syndrome, Barter's syndrome, Fanconi's syndrome, renal electrolyte 
wasting, diabetes insipidus. 

The human DPP4 is highly expressed in the following metabolic disease related tissues: 
5 adipose and ens. The expression in the above mentioned tissues demonstrates that the 
human DPP4 or mRNA can be utilized to diagnose of metabolic diseases. Additionally 
the activity of the human DPP4 can be modulated to treat metabolic diseases. 

Applications 

10 

The present invention provides for both prophylactic and therapeutic methods for 
cardiovascular disorders, dermatological disorders, cancer disorders, hematological 
disorders, respiratory diseases, gastrointestinal and liver diseases, urological disorders 
and metabolic diseases. 

15 

The regulatory method of the invention involves contacting a cell with an agent that 
modulates one or more of the activities of DPP4. An agent that modulates activity can 
be an agent as described herein, such as a nucleic acid or a protein, a naturally-occurring 
cognate ligand of the polypeptide, a peptide, a peptidomimetic, or any small molecule. 

20 In one embodiment, the agent stimulates one or more of the biological activities of 
DPP4. Examples of such stimulatory agents include the active DPP4 and nucleic acid 
molecules encoding a portion of DPP4. In another embodiment, the agent inhibits one 
or more of the biological activities of DPP4. Examples of such inhibitory agents include 
antisense nucleic acid molecules and antibodies. These regulatoiy methods can be 

25 performed in vitro (e.g., by culturing the cell with the agent) or, alternatively, in vivo 
(e.g, by administering the agent to a subject). As such, the present invention provides 
methods of treating an individual afflicted with a disease or disorder characterized by 
unwanted expression or activity of DPP4 or a protein in the DPP4 signaling pathway. In 
one embodiment, the method involves administering an agent like any agent identified 

30 or being identifiable by a screening assay as described herein, or combination of such 
agents that modulate say upregulate or downregulate the expression or activity of DPP4 
or of any protein in the DPP4 signaling pathway. In another embodiment, the method 



WO 2004/104216 



PCT/EP2004/005071 



.74. 

involves administering a regulator of DPP4 as therapy to compensate for reduced or 
undesirably low expression or activity of DPP4 or a protein in the DPP4 signaling 
pathway. 

5 Stimulation of activity or expression of DPP4 is desirable in situations in which 
enzymatic activity or expression is abnormally low and in which increased activity is 
likely to have a beneficial effect. Conversely, inhibition of enzymatic activity or 
expression of DPP4 is desirable in situations in which activity or expression of DPP4 is 
abnormally high and in which decreasing its activity is likely to have a beneficial effect. 

10 

This invention is further illustrated by the following examples which should not be 
construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application are hereby incorporated by reference. 

15 Pharmaceutical Compositions 

This invention further pertains to novel agents identified by the above-described 
screening assays and uses thereof for treatments as described herein. 

20 The nucleic acid molecules, polypeptides, and antibodies (also referred to herein as 
"active compounds") of the invention can be incorporated into pharmaceutical 
compositions suitable for administration. Such compositions typically comprise the 
nucleic acid molecule, protein, or antibody and a pharmaceutically acceptable carrier. 
As used herein the language "pharmaceutically acceptable carrier" is intended to include 

25 any and all solvents, dispersion media, coatings, antibacterial and antifungal agents, 
isotonic and absorption delaying agents, and the like, compatible with pharmaceutical 
administration. The use of such media and agents for pharmaceutically active 
substances is well known in the art. Except insofar as any conventional media or agent 
is incompatible with the active compound, use thereof in the compositions is 

30 contemplated. Supplementary active compounds can also be incorporated into the 
compositions. 
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The invention includes pharmaceutical compositions comprising a regulator of DPP4 
expression or activity (and/or a regulator of the activity or expression of a protein in the 
DPP4 signaling pathway) as well as methods for preparing such compositions by 
combining one or more such regulators and a pharmaceutical^ acceptable carrier. Also 
5 within the invention are pharmaceutical compositions comprising a regulator identified 
using the screening assays of the invention packaged with instructions for use. For 
regulators that are antagonists of DPP4 activity or which reduce DPP4 expression, the 
instructions would specify use of the pharmaceutical composition for treatment of 
hematological and cardiovascular diseases, disorders of the peripheral and central 
10 nervous system, COPD, asthma, genito-urological disorders and inflammation diseases. 
For regulators that are agonists of DPP4 activity or increase DPP4 expression, the 
instructions would specify use of the pharmaceutical composition for treatment of 
hematological and cardiovascular diseases, disorders of the peripheral and central 
nervous system, COPD, asthma, genito-urological disorders and inflammation diseases. 

15 

An inhibitor of DPP4 may be produced using methods which are generally known in the 
art. In particular, purified DPP4 may be used to produce antibodies or to screen libraries 
of pharmaceutical agents to identify those which specifically bind DPP4. Antibodies to 
DPP4 may also be generated using methods that are well known in the art. Such 
20 antibodies may include, but are not limited to, polyclonal, monoclonal, chimeric, single 
chain antibodies, Fab fragments, and fragments produced by a Fab expression library. 
Neutralizing antibodies like those which inhibit dimer formation are especially preferred 
for therapeutic use. 

25 In another embodiment of the invention, the polynucleotides encoding DPP4, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, 
the complement of the polynucleotide encoding DPP4 may be used in situations in 
which it would be desirable to block the transcription of the mRNA. In particular, cells 
may be transformed with sequences complementary to polynucleotides encoding DPP4. 

30 Thus, complementary molecules or fragments may be used to modulate DPP4 activity, 
or to achieve regulation of gene function. Such technology is now well known in the art, 
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and sense or antisense oligonucleotides or larger fragments can be designed from various 
locations along the coding or control regions of sequences encoding DPP4. 

Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia 
5 viruses, or from various bacterial plasmids, may be used for delivery of nucleotide 
sequences to the targeted organ, tissue, or cell population. Methods which are well 
known to those skilled in the art can be used to construct vectors which will express 
nucleic acid sequence complementary to the polynucleotides of the gene encoding 
DPP4. These techniques are described, for example, in [Scott and Smith (1990)]. 

10 

Any of the therapeutic methods described above may be applied to any subject in need 
of such therapy, including, for example, mammals such as dogs, cats, cows, horses, 
rabbits, monkeys, and most preferably, humans. 

15 An additional embodiment of the invention relates to the administration of a 
pharmaceutical composition containing DPP4 in conjunction with a pharmaceutical^ 
acceptable carrier, for any of the therapeutic effects discussed above. Such 
pharmaceutical compositions may consist of DPP4, antibodies to DPP4, and mimetics, 
agonists, antagonists, or inhibitors of DPP4. The compositions may be administered 

20 alone or in combination with at least one other agent, such as a stabilizing compound, 
which may be administered in any sterile, biocompatible pharmaceutical carrier 
including, but not limited to, saline, buffered saline, dextrose, and water. The 
compositions may be administered to a patient alone, or in combination with other 
agents, drugs or hormones. 

25 

A pharmaceutical composition of the invention is formulated to be compatible with its 
intended route of administration. Examples of routes of administration include 
parenteral, e.g., intravenous, intradermal, subcutaneous, oral (e.g., inhalation), 
transdermal (topical), transmucosal, and rectal administration. Solutions or suspensions 
30 used for parenteral, intradermal, or subcutaneous application can include the following 
components: a sterile diluent such as water for injection, saline solution, fixed oils, 
polyethylene glycols, glycerine, propylene glycol or other synthetic solvents; 



WO 2004/104216 



PCT/EP2004/005071 



-77- 

antibacterial agents such as benzyl alcohol or methyl parabens; antioxidants such as 
ascorbic acid or sodium bisulfite; chelating agents such as ethylenediaminetetraacetic 
acid; buffers such as acetates, citrates or phosphates and agents for the adjustment of 
tonicity such as sodium chloride or dextrose. pH can be adjusted with acids or bases, 
5 such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampoules, disposable syringes or multiple dose vials made of glass or 
plastic. 

Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions 

10 (where water soluble) or dispersions and sterile powders for the extemporaneous 
preparation of sterile injectable solutions or dispersions. For intravenous administration, 
suitable carriers include physiological saline, bacteriostatic water, Cremophor EM™ 
(BASF, Parsippany, NJ.) or phosphate buffered saline (PBS). In all cases, the 
composition must be sterile and should be fluid to the extent that easy syringability 

15 exists. It must be stable under the conditions of manufacture and storage and must be 
preserved against the contaminating action of microorganisms such as bacteria and 
fungi. The carrier can be a solvent or dispersion medium containing, for example, 
water, ethanol, a pharmaceutical^ acceptable polyol like glycerol, propylene glycol, 
liquid polyetheylene glycol, and suitable mixtures thereof. The proper fluidity can be 

20 maintained, for example, by the use of a coating such as lecithin, by the maintenance of 
the required particle size in the case of dispersion and by the use of surfactants. 
Prevention of the action of microorganisms can be achieved by various antibacterial and 
antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, 
thimerosal, and the like. In many cases, it will be preferable to include isotonic agents, 

25 for example, sugars, polyalcohols such as mannitol, sorbitol, sodium chloride in the 
composition. Prolonged absorption of the injectable compositions can be brought about 
by including in the composition an agent which delays absorption, for example, 
aluminum monostearate and gelatin. Sterile injectable solutions can be prepared by 
incorporating the active compound (e.g., a polypeptide or antibody) in the required 

30 amount in an appropriate solvent with one or a combination of ingredients enumerated 
above, as required, followed by filtered sterilization. Generally, dispersions are prepared 
by incorporating the active compound into a sterile vehicle which contains a basic 
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dispersion medium and the required other ingredients from those enumerated above. In 
the case of sterile powders for the preparation of sterile injectable solutions, the 
preferred methods of preparation are vacuum drying and freeze-drying which yields a 
powder of the active ingredient plus any additional desired ingredient from a previously 
5 sterile-filtered solution thereof. 

Oral compositions generally include an inert diluent or an edible carrier. They can be 
enclosed in gelatin capsules or compressed into tablets. For the purpose of oral 
therapeutic administration, the active compound can be incorporated with excipients and 
10 used in the form of tablets, troches, or capsules. Oral compositions can also be prepared 
using a fluid carrier for use as a mouthwash, wherein the compound in the fluid carrier is 
applied orally and swished and expectorated or swallowed. 

Pharmaceutically compatible binding agents, and/or adjuvant materials can be included 
15 as part of the composition. The tablets, pills, capsules, troches and the like can contain 
any of the following ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or 
lactose, a disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant 
such as magnesium stearate or sterotes; a glidant such as colloidal silicon dioxide; a 
20 sweetening agent such as sucrose or saccharin; or a flavoring agent such as peppermint, 
methyl salicylate, or orange flavoring. 

For administration by inhalation, the compounds are delivered in the form of an aerosol 
spray from a pressurized container or dispenser which contains a suitable propellant, 
25 e.g., a gas such as carbon dioxide, or a nebulizer. 

Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be 
permeated are used in the formulation. Such penetrants are generally known in the art, 
30 and include, for example, for transmucosal administration, detergents, bile salts, and 
fusidic acid derivatives. Transmucosal administration can be accomplished through the 
use of nasal sprays or suppositories. For transdermal administration, the active 
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compounds are formulated into ointments, salves, gels, or creams as generally known in 
the art. 

The compounds can also be prepared in the form of suppositories (e.g., with 
5 conventional suppository bases such as cocoa butter and other glycerides) or retention 
enemas for rectal delivery. 

In one embodiment, the active compounds are prepared with carriers that will protect the 
compound against rapid elimination from the body, such as a controlled release 

10 formulation, including implants and microencapsulated delivery systems. 
Biodegradable, biocompatible polymers can be used, such as ethylene vinyl acetate, 
polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and polylactic acid. 
Methods for preparation of such formulations will be apparent to those skilled in the art. 
The materials can also be obtained commercially from Alza Corporation and Nova 

15 . Pharmaceuticals, Inc. Liposomal suspensions (including liposomes targeted to infected 
cells with monoclonal antibodies to viral antigens) can also be used as pharmaceutical^ 
acceptable carriers. These can be prepared according to methods known to those skilled 
in the art, for example, as described in U.S. 4,522,81 1. 

20 It is especially advantageous to formulate oral or parenteral compositions in dosage unit 
form f or e ase o f a dministration a nd u niformity o f d osage. D osage u nit form as used 
herein refers to physically discrete units suited as unitary dosages for the subject to be 
treated; each unit containing a predetermined quantity of active compound calculated to 
produce the desired therapeutic effect in association with the required pharmaceutical 

25 carrier. The specification for the dosage unit forms of the invention are dictated by and 
directly dependent on the unique characteristics of the active compound and the 
particular therapeutic effect to be achieved, and the limitations inherent in the art of 
compounding such an active compound for the treatment of individuals. 

30 The pharmaceutical compositions can be included in a container, pack, or dispenser 
together with instructions for administration. For pharmaceutical compositions which 
include an antagonist of DPP4 activity, a compound which reduces expression of DPP4, 
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or a compound which reduces expression or activity of a protein in the DPP4 signaling 
pathway or any combination thereof, the instructions for administration will specify use 
of the composition for hematological and cardiovascular diseases, disorders of the 
peripheral and central nervous system, COPD, asthma, genito-urological disorders and 

5 inflammation diseases. For pharmaceutical compositions which include an agonist of 
DPP4 activity, a compound which increases expression of DPP4, or a compound which 
increases expression or activity of a protein in the DPP4 signaling pathway or any 
combination thereof; the instructions for administration will specify use of the 
composition for cardiovascular disorders, dermatological disorders, cancer disorders, 

10 hematological disorders, respiratory diseases, gastrointestinal and liver diseases, 
urological disorders and metabolic diseases. 

Diagnostics 

15 In another embodiment, antibodies which specifically bind DPP4 may be used for the 
diagnosis of disorders characterized by the expression of DPP4, or in assays to monitor 
patients being treated with DPP4 or agonists, antagonists, and inhibitors of DPP4. 
Antibodies useful for diagnostic purposes may be prepared in the same manner as those 
described above for therapeutics. Diagnostic assays for DPP4 include methods which 

20 utilize the antibody and a label to detect DPP4 in human body fluids or in extracts of 
cells or tissues. The antibodies may be used with or without modification, and may be 
labeled by covalent or non-covalent joining with a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may 
be used. 

25 

A variety of protocols for m easuring DPP4, including ELISAs, RIAs, and F ACS, are 
known in the art and provide a basis for diagnosing altered or abnormal levels of DPP4 
expression. Normal or standard values for DPP4 expression are established by 
combining body fluids or cell extracts taken from normal mammalian subjects, 
30 preferably human, with antibody to DPP4 under conditions suitable for complex 
formation. The amount of standard complex formation may be quantified by various 
methods, preferably by photometric means. Quantities of DPP4 expressed in subject 
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samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding DPP4 may be 
5 used for diagnostic purposes. The polynucleotides which may be used include oligo- 
nucleotide sequences, complementary RNA and DNA molecules, and PNAs. The poly- 
nucleotides may be used to detect and quantitate gene expression in biopsied tissues in 
which expression of DPP4 may be correlated with disease. The diagnostic assay may be 
used to distinguish between absence, presence, and excess expression of DPP4, and to 
1 0 monitor regulation of DPP4 levels during therapeutic intervention. 

Polynucleotide sequences encoding DPP4 may be used for the diagnosis of cardio- 
vascular disorders, dermatological disorders, cancer disorders, hematological disorders, 
respiratory diseases, gastrointestinal and liver diseases, urological disorders and 
IS metabolic diseases associated with expression of DPP4. The polynucleotide sequences 
encoding DPP4 may be used in Southern, Northern, or dot-blot analysis, or other 
membrane-based technologies; in PCR technologies; in dipstick, pin, and ELISA assays; 
and in microarrays utilizing fluids or tissues from patient biopsies to detect altered DPP4 
expression. Such qualitative or quantitative methods are well known in the art. 

20 

In a particular aspect, the nucleotide sequences encoding DPP4 may be useful in assays 
that detect the presence of associated disorders, particularly those mentioned above. 
The nucleotide sequences encoding DPP4 may be labelled by standard methods and 
added to a fluid or tissue sample from a patient under conditions suitable for the 

25 formation of hybridization complexes. After a suitable incubation period, the sample is 
washed and the signal is quantitated and compared with a standard value. If the amount 
of signal in the patient sample is significantly altered from that of a comparable control 
sample, the nucleotide sequences have hybridized with nucleotide sequences in the 
sample, and the presence of altered levels of nucleotide sequences encoding DPP4 in the 

30 sample indicates the presence of the associated disorder. Such assays may also be used 
to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, 
in clinical trials, or in monitoring the treatment of an individual patient. 
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In order to provide a basis for the diagnosis of cardiovascular disorders, dermatological 
disorders, cancer disorders, hematological disorders, respiratory diseases, 
gastrointestinal and liver diseases, urological disorders and metabolic diseases 

5 associated with expression of DPP4, a normal or standard profile for expression is 
established. This may be accomplished by combining body fluids or cell extracts taken 
from normal subjects, either animal or human, with a sequence, or a fragment thereof, 
encoding DPP4, under conditions suitable for hybridization or amplification. Standard 
hybridization may be quantified by comparing the values obtained from normal subjects 

10 with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained from normal samples may be 
compared with values obtained from samples from patients who are symptomatic for a 
disorder. Deviation from standard values is used to establish the presence of a disorder. 

15 Determination of a Therapeutically Effective Dose 

The determination of a therapeutically effective dose is well within the capability of 
those skilled in the art. A therapeutically effective dose refers to that amount of active 
ingredient which increases or decreases DPP4 activity relative to DPP4 activity which 

20 occurs in the absence of the therapeutically effective dose. For any compound, the 
therapeutically effective dose can be estimated initially either in cell culture assays or in 
animal models, usually mice, rabbits, dogs, or pigs. The animal model also can be used 
to determine the appropriate concentration range and route of administration. Such 
information can then be used to determine useful doses and routes for administration in 

25 humans. 

Therapeutic efficacy and toxicity, e.g., ED 5 o (the dose therapeutically effective in 50% 
of the population) and LD50 (the dose lethal to 50% of the population), can be 
determined by standard pharmaceutical procedures in cell cultures or experimental 
30 animals. The dose ratio of toxic to therapeutic effects is the therapeutic index, and it can 
be expressed as the ratio, LD50/ED50. Pharmaceutical compositions which exhibit large 
therapeutic indices are preferred. The data obtained from cell culture assays and animal 
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studies is used in formulating a range of dosage for human use. The dosage contained in 
such compositions is preferably within a range of circulating concentrations that include 
the ED 5 o with little or no toxicity. The dosage varies within this range depending upon 
the dosage form employed, sensitivity of the patient, and the route of administration. 
5 The exact dosage will be determined by the practitioner, in light of factors related to the 
subject that requires treatment. Dosage and administration are adjusted to provide 
sufficient levels of the active ingredient or to maintain the desired effect. Factors which 
can be taken into account include the severity of the disease state, general health of the 
subject, age, weight, and gender of the subject, diet, time and frequency of 
10 administration, drug combination(s), reaction sensitivities, and tolerance/response to 
therapy. Long-acting pharmaceutical compositions can be administered every 3 to 4 
days, every week, or once every two weeks depending on the half-life and clearance rate 
of the particular formulation. 

15 Normal dosage amounts can vary from 0.1 micrograms to 100,000 micrograms, up to a 
total dose of about 1 g, depending upon the route of administration. Guidance as to 
particular d osages a nd m ethods o f d eli very i s p rovided i n t he literature and generally 
available to practitioners in the art. Those skilled in the art will employ different 
formulations for nucleotides than for proteins or their inhibitors. Similarly, delivery of 

20 polynucleotides or polypeptides will be specific to particular cells, conditions, locations, 
etc. If the reagent is a single-chain antibody, polynucleotides encoding the antibody can 
be constructed and introduced into a cell either ex vivo or in vivo using well-established 
techniques including, but not limited to, transferrin-polycation-mediated DNA transfer, 
transfection with naked or encapsulated nucleic acids, liposome-ihediated cellular 

25 fusion, intracellular transportation of DNA-coated latex beads, protoplast fusion, viral 
infection, electroporation, "gene gun", and DEAE- or calcium phosphate-mediated 
transfection. 

If the expression product is mRNA, the reagent is preferably an antisense oligo- 
30 nucleotide or a ribozyme. Polynucleotides which express antisense oligonucleotides or 
ribozymes can be introduced into cells by a variety of methods, as described above. 
Preferably, a reagent reduces expression of DPP4 gene or the activity of DPP4 by at least 
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about 10, preferably about 5 0, more preferably about 7 5, 90, or 1 00% relative to the 
absence of the reagent. The effectiveness of the mechanism chosen to decrease the level 
of expression of DPP4 gene or the activity of DPP4 can be assessed using methods well 
known in the art, such as hybridization of nucleotide probes to DPP4-specific mRNA, 
5 quantitative RT-PCR, immunologic detection of DPP4, or measurement of DPP4 
activity. 

In any of the embodiments described above, any of the pharmaceutical compositions of 
the invention can be administered in combination with other appropriate therapeutic 

10 agents. Selection of the appropriate agents for use in combination therapy can be made 
by one of ordinary skill in the art, according to conventional pharmaceutical principles. 
The combination of therapeutic agents can act synergistically to effect the treatment or 
prevention of the various disorders described above. Using this approach, one may be 
able to achieve therapeutic efficacy with lower dosages of each agent, thus reducing the 

15 potential for adverse side effects. Any of the therapeutic methods described above can 
be applied to any subject in need of such therapy, including, for example, mammals such 
as dogs, cats, cows, horses, rabbits, monkeys, and most preferably, humans. 

Nucleic acid molecules of the invention are those nucleic acid molecules which are 
20 contained in a group of nucleic acid molecules consisting of (i) nucleic acid molecules 
encoding a polypeptide comprising the amino acid sequence of SEQ ID NO: 2, (ii) 
nucleic acid molecules comprising the sequence of SEQ ED NO: 1, (iii) nucleic acid 
molecules having the sequence of SEQ ID NO: 1, (iv)nucleic acid molecules the 
complementary strand of which hybridizes under stringent conditions to a nucleic acid 
25 molecule of (i), (ii), or (iii); and (v) nucleic acid molecules the sequence of which differs 
from the sequence of a nucleic acid molecule of (iii) due to the degeneracy of the genetic 
code, wherein the polypeptide encoded by said nucleic acid molecule has DPP4 activity. 

Polypeptides of the invention are those polypeptides which are contained in a group of 
30 polypeptides consisting of (i) polypeptides having the sequence of SEQ ID NO: 2, (ii) 
polypeptides comprising the sequence of SEQ ID NO: 2, (iii) polypeptides encoded by 
nucleic acid molecules of the invention and (iv) polypeptides which show at least 99%, 
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98%, 95%, 90%, or 80% homology with a polypeptide of (i), (ii), or (iii), wherein said 
purified polypeptide has DPP4 activity. 

An object of the invention is a method of screening for therapeutic agents useful in the 
5 treatment of a disease comprised in a group of diseases consisting of cardiovascular 
disorders, dermatological disorders, cancer disorders, hematological disorders, 
respiratory diseases, gastrointestinal and liver diseases, urological disorders and 
metabolic diseases in a mammal comprising the steps of (i) contacting a test compound 
with a DPP4 polypeptide, (ii) detect binding of said test compound to said DPP4 
10 polypeptide. E.g., compounds that bind to the DPP4 polypeptide are identified potential 
therapeutic agents for such a disease. 

Another object of the invention is a method of screening for therapeutic agents useful in 
the treatment of a disease comprised in a group of diseases consisting of cardiovascular 

15 disorders, dermatological disorders, cancer disorders, hematological disorders, 
respiratory diseases, gastrointestinal and liver diseases, urological disorders and 
metabolic diseases in a mammal comprising the steps of (i) determining the activity of a 
DPP4 polypeptide at a certain concentration of a test compound or in the absence of said 
test compound, (ii) determining the activity of said polypeptide at a different 

20 concentration of said test compound. E.g., compounds that lead to a difference in the 
activity of the DPP4 polypeptide in (i) and (ii) are identified potential therapeutic agents 
for such a disease. 

Another object of the invention is a method of screening for therapeutic agents useful in 
25 the treatment of a disease comprised in a group of diseases consisting of cardiovascular 
disorders, dermatological disorders, cancer disorders, hematological disorders, 
respiratory diseases, gastrointestinal and liver diseases, urological disorders and 
metabolic diseases in a mammal comprising the steps of (i) determining the activity of a 
DPP4 polypeptide at a certain concentration of a t est c ompound, (ii) determining the 
30 activity of a DPP4 polypeptide at the presence of a compound known to be a regulator of 
a DPP4 polypeptide. E.g., compounds that show similar effects on the activity of the 
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DPP4 polypeptide in (i) as compared to compounds used in (ii) are identified potential 
therapeutic agents for such a disease. 

Other objects of the invention are methods of the above, wherein the step of contacting 
5 is in or at the surface of a cell. 

Other objects of the invention are methods of the above, wherein the cell is in vitro. 

Other objects of the invention are methods of the above, wherein the step of contacting 
10 is in a cell-free system. 

Other objects of the invention are methods of the above, wherein the polypeptide is 
coupled to a detectable label. 

15 Other objects of the invention are methods of the above, wherein the compound is 
coupled to a detectable label. 

Other objects of the invention are methods of the above, wherein the test compound 
displaces a ligand which is first bound to the polypeptide. 

20 

Other objects of the invention are methods of the above, wherein the polypeptide is 
attached to a solid support. 

Other objects of the invention are methods of the above, wherein the compound is 
25 attached to a solid support. 

Another object of the invention is a method of screening for therapeutic agents useful in 
the treatment of a disease comprised in a group of diseases consisting of cardiovascular 
disorders, dermatological disorders, cancer disorders, hematological disorders, 
30 respiratory diseases, gastrointestinal and liver diseases, urological disorders and 
metabolic diseases in a mammal comprising the steps of (i) contacting a test compound 
with a DPP4 polynucleotide, (ii) detect binding of said test compound to said DPP4 
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polynucleotide. Compounds that, e .g., bind to the DPP4 polynucleotide are potential 
therapeutic agents for the treatment of such diseases. 

Another object of the invention is the method of the above, wherein the nucleic acid 
5 molecule is RNA. 

Another object of the invention is a method of the above, wherein the contacting step is 
in or at the surface of a cell. 

10 Another object of the invention is a method of the above, wherein the contacting step is 
in a cell-free system. 

Another object of the invention is a method of the above, wherein the polynucleotide is 
coupled to a detectable label. 

15 

Another object of the invention is a method of the above, wherein the test compound is 
coupled to a detectable label. 

Another object of the invention is a method of diagnosing a disease comprised in a 
20 group of diseases consisting of cardiovascular disorders, dermatological disorders, 
cancer disorders, hematological disorders, respiratory diseases, gastrointestinal and liver 
diseases, urological disorders and metabolic diseases in a mammal comprising the steps 
of(i) determining the amount of a DPP4 polynucleotide in a sample taken from said 
mammal, (ii) determining the amount of DPP4 polynucleotide in healthy and/or diseased 
25 mammal. A disease is diagnosed, e.g., if there is a substantial similarity in the amount 
of DPP4 polynucleotide in said test mammal as compared to a diseased mammal. 

Another object of the invention is a pharmaceutical composition for the treatment of a 
disease comprised in a group of diseases consisting of cardiovascular disorders, 
30 dermatological disorders, cancer disorders, hematological disorders, respiratory 
diseases, gastrointestinal and liver diseases, urological disorders and metabolic diseases 
in a mammal comprising a therapeutic agent which binds to a DPP4 polypeptide. 
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Another object of the invention is a pharmaceutical composition for the treatment of a 
disease comprised in a group of diseases consisting of cardiovascular disorders, 
dermatological disorders, cancer disorders, hematological disorders, respiratoiy 
5 diseases, gastrointestinal and liver diseases, urological disorders and metabolic diseases 
in a mammal comprising a therapeutic agent which regulates the activity of a DPP4 
polypeptide. 

Another object of the invention is a pharmaceutical composition for the treatment of a 
10 disease comprised in a group of diseases consisting of cardiovascular disorders, 
dermatological disorders, cancer disorders, hematological disorders, respiratory 
diseases, gastrointestinal and liver diseases, urological disorders and metabolic diseases 
in a mammal comprising a therapeutic agent which regulates the activity of a DPP4 
polypeptide, wherein said therapeutic agent is (i) a small molecule, (ii) an RNA 
15 molecule, (iii) an antisense oligonucleotide, (iv) a polypeptide, (v) an antibody, or (vi) a 
ribozyme. 

Another object of the invention is a pharmaceutical composition for the treatment of a 
disease comprised in a group of diseases consisting of cardiovascular disorders, 
20 dermatological disorders, cancer disorders, hematological disorders, respiratory 
diseases, gastrointestinal and liver diseases, urological disorders and metabolic diseases 
in a mammal comprising a DPP4 polynucleotide. 

Another object of the invention is a pharmaceutical composition for the treatment of a 
25 disease comprised in a group of diseases consisting of cardiovascular disorders, 
dermatological disorders, cancer disorders, hematological disorders, respiratory 
diseases, gastrointestinal and liver diseases, urological disorders and metabolic diseases, 
in a mammal comprising a DPP4 polypeptide. 

30 Another object of the invention is the use of regulators of a DPP4 for the preparation of 
a pharmaceutical composition for the treatment of a disease comprised in a group of 
diseases consisting of cardiovascular disorders, dermatological disorders, cancer 
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disorders, hematological disorders, respiratory diseases, gastrointestinal and liver 
diseases, urological disorders and metabolic diseases in a mammal. 

Another object of the invention is a method for the preparation of a pharmaceutical 
5 composition useful for the treatment of a disease comprised in a group of diseases 
consisting of cardiovascular disorders, deimatological disorders, cancer disorders, 
hematological disorders, respiratory diseases, gastrointestinal and liver diseases, 
urological disorders and metabolic diseases in a mammal comprising the steps of (i) 
identifying a regulator of DPP4, (ii) determining whether said regulator ameliorates the 
10 symptoms of a disease comprised in a group of diseases consisting of cardiovascular 
disorders, dermatological disorders, cancer disorders, hematological disorders, 
respiratory diseases, gastrointestinal and liver diseases, urological disorders and 
metabolic diseases in a mammal; and (iii) combining of said regulator with an 
acceptable pharmaceutical carrier. 

15 

Another object of the invention is the use of a regulator of DPP4 for the regulation of 
DPP4 activity in a mammal having a disease comprised in a group of diseases consisting 
of cardiovascular disorders, dermatological disorders, cancer disorders, hematological 
disorders, respiratory diseases, gastrointestinal and liver diseases, urological disorders 
20 and metabolic diseases. 

The examples below are provided to illustrate the subject invention. These examples are 
provided by way of illustration and are not included for the purpose of limiting the 
invention. 



25 
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Examples 

Example 1 : Search for homologous sequences in pu blic sequence data bases 

5 The degree of homology can readily be calculated by known methods. Preferred 
methods to determine homology are designed to give the largest match between the 
sequences tested. Methods to determine homology are codified in publicly available 
computer programs such as BestFit, BLASTP, BLASTN, and FASTA. The BLAST 
programs are publicly available from NCBI and other sources in the internet. 

10 

For DPP4 the following hits to known sequences were identified by using the BLAST 
algorithm [Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman 
DJ; Nucleic Acids Res 1997 Sep 1; 25(17): 3389-402] and the following set of 
parameters: matrix = BLOSUM62 and low complexity filter. The following databases 
15 were searched: NCBI (non-redundant database) and DERWENT patent database 
(Geneseq). 

The following hits were found: 

20 >gb|AAB44838.1| Sequence 3 from patent US 5587299 gb(AAE30606.1| Sequence 3 
from patent US 5965373 gb|AAE88377.1| Sequence 1 from patent US 6337069 Length 
= 766 Score = 1594 bits (4127), Expect = 0.0 Identities = 766/766 (100%), Positives = 
766/766 (100%) 

25 >ref|NP_00 1926.2| dipeptidylpeptidase IV; dipeptidylpeptidase IV (CD26, adenosine 
deaminase complexing protein 2); adenosine deaminase complexing protein 2; T-cell 
activation antigen CD26 [Homo sapiens] sp|P27487|DPP4_HUMAN Dipeptidyl 
peptidase IV (DPP IV) (T-cell activation antigen CD26) (TP103) (Adenosine deaminase 
complexing protein-2) (ADABP) gb|AAA51943.1| dipeptidyl peptidase IV 

30 gb|AAB60646.1| dipeptidyl peptidase IV [Homo sapiens] gbjAAHl 3329. 1|AAH 13329 
dipeptidylpeptidase IV (CD26, adenosine deaminase complexing protein 2) [Homo 



WO 2004/104216 



PCT/EP2004/005071 



-91 - 

sapiens] Length = 766 Score = 1594 bits (4127), Expect = 0.0 Identities = 766/766 
(100%), Positives = 766/766 (100%) 

>pir||CDHU26 dipeptidyl-peptidase IV (EC 3.4.14.5) - human Length = 766 Score = 
5 1 592 bits (4121), Expect = 0.0 Identities = 765/766 (99%), Positives = 765/766 (99%) 

>emb|CAA43 118.1| dipeptidyl peptidase iv [Homo sapiens] Length = 766 Score = 1591 
bits (4120), Expect = 0.0 Identities - 764/766 (99%), Positives = 765/766 (99%) 

10 >gb|AAA52308.1| dipeptidyl peptidase IV Length = 766 Score = 1589 bits (4114), 
Expect = 0.0 Identities = 763/766 (99%), Positives = 765/766 (99%) 

>pdb|lNlM|A Chain A, Human Dipeptidyl Peptidase IvCD26 IN COMPLEX WITH 
AN Inhibitor pdb|lNlM|B Chain B, Human Dipeptidyl Peptidase IvCD26 IN 
15 COMPLEX WITH AN Inhibitor Length = 728 Score = 1521 bits (3939), Expect = 0.0 
Identities = 728/728 (100%), Positives = 728/728 (100%) 

>ref|NP_776464.1| dipepndylpeptidase IV (CD26, adenosine deaminase complexing 
protein 2) [Bos taurus] sp|P81425|DPP4_BOVIN Dipeptidyl peptidase IV (DPP IV) (T- 
20 cell activation antigen CD26) (Adenosine deaminase complexing protein) (ADCP-I) 
(Activation molecule 3) (ACT3) (WC10) gb|AAL67836.1|AF461806_l dipeptidyl 
peptidase IV [Bos taurus] gb|AAL23628.1| dipeptidyl peptidase IV [Bos taurus] Length 
= 765 Score = 1439 bits (3725), Expect = 0.0 Identities = 679/765 (88%), Positives = 
723/765 (93%), Gaps = 1/765 (0%) 

25 

>gb|AAO43404.1| dipeptidyl peptidase IV [Sus scrofa] Length = 766 Score = 1439 bits 
(3725), Expect = 0.0 Identities = 677/766 (88%), Positives = 724/766 (94%) 

>sp|Q9N2I7|DPP4_FELCA Dipeptidyl peptidase IV (DPP IV) (T-cell activation antigen 
30 CD26) dbj|BAA92344.1| dipeptidyl peptidase IV [Felis catus] Length = 765 Score = 
1435 bits (3715), Expect = 0.0 Identities - 674/766 (87%), Positives = 723/766 (93%), 
Gaps =1/766(0%) 
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>pir||A39914 dipeptidyl-peptidase IV (EC 3.4.14.5), membrane-bound form precursor - 
rat gb|AAA41272.1| bile canaliculus-specific membrane glycoprotein precursor Length 
= 792 Score = 1374 bits (3556), Expect = 0.0 Identities = 650/768 (84%), Positives = 
5 705/768 (91%), Gaps = 5/768 (0%) 

>ref|NP_036921.1| dipeptidyl peptidase 4 [Rattus norvegicus] sp|P14740|DPP4_RAT 
Dipeptidyl peptidase IV (DPP IV) (T-cell activation antigen CD26) (GP110 
glycoprotein) (Bile canaliculus domain-specific membrane glycoprotein) 
10 gb|AAA41096.1| dipeptidyl peptidase IV (EC 3.4.14.5) Length = 767 Score = 1372 bits 
(3551), Expect = 0.0 Identities = 650/768 (84%), Positives = 704/768 (91%), Gaps = 
5/768 (0%) 

>ref]NP_034204.1| dipeptidylpeptidase 4 [Mus musculus] sp|P28843|DPP4_MOUSE 
15 Dipeptidyl peptidase IV (DPP IV) (T-cell activation antigen CD26) (Thymocyte- 
activating molecule) (THAM) emb|CAA41274.1| dipeptidyl peptidase iv [Mus 
musculus] gb|AAA82213.1| dipeptidyl peptidase IV gb|AAH22183.1| 
dipeptidylpeptidase 4 [Mus musculus] dbj|BAC39434.1| unnamed protein product [Mus 
musculus] Length = 760 Score = 1365 bits (3532), Expect = 0.0 Identities = 647/765 
20 (84%), Positives = 707/765 (91%), Gaps = 6/765 (0%) 

>pir||S23752 dipeptidyl-peptidase IV (EC 3.4.14.5) alpha chain - mouse Length = 760 
Score = 1363 bits (3528), Expect = 0.0 Identities = 646/765 (84%), Positives = 706/765 
(91%), Gaps = 6/765(0%) 

25 

>emb|CAA70136.1| dipeptidyl-peptidase IV [Xenopus laevis] Length = 748 Score = 916 
bits (2367), Expect = 0.0 Identities = 443/766 (57%), Positives = 558/766 (72%), Gaps = 
20/766 (2%) 

30 >ref|NP_00445 1 .2| fibroblast activation protein, alpha subunit; integral membrane serine 
protease; seprase [Homo sapiens] gb|AAC51668.1| integral membrane serine protease 
Seprase [Homo sapiens] gb|AAH26250.1| fibroblast activation protein, alpha [Homo 
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sapiens] Length = 760 Score = 843 bits (2178), Expect = 0.0 Identities = 395/767 (51%), 
Positives - 544/767 (70%), Gaps = 11/767 (1%) 

Example 2; Expression profiling 

5 

Total cellular RNA was isolated from cells by one of two standard methods: 1) 
guanidine isothiocyanate/Cesium chloride density gradient centrifiigation [Kellogg, 
(1990)]; or with the Tri-Reagent protocol according to the manufacturer's specifications 
(Molecular Research Center, Inc., Cincinatti, Ohio). Total RNA prepared by the Tri- 
10 reagent protocol was treated with DNAse I to remove genomic DNA contamination. 

For relative quantitation of the mRNA distribution of DPP4, total RNA from each cell or 
tissue source was first reverse transcribed. 85 \ig of total RNA was reverse transcribed 
using 1 fimole random hexamer primers, 0.5 mM each of dATP, dCTP, dGTP and dTTP 
1 5 (Qiagen, Hilden, Germany), 3000 U RnaseQut (Invitrogen, Groningen, Netherlands) in a 
final volume of 680 pi. The first strand synthesis buffer and Omniscript reverse 
transcriptase (2 u/pl) were from (Qiagen, Hilden, Germany). The reaction was 
incubated at 37°C for 90 minutes and cooled on ice. The volume was adjusted to 6800 
pi with water, yielding a final concentration of 12.5 ng/pl of starting RNA. 

20 

For relative quantitation of the distribution of DPP4 mRNA in cells and tissues the 
Perkin Elmer ABI Prism RTM. 7700 Sequence Detection system or Biorad iCycler was 
used according to the manufacturer's specifications and protocols. PCR reactions were 
set up to quantitate DPP4 and the housekeeping genes HPRT (hypoxanthine 

25 phosphoribosyltransferase), GAPDH (glyceraldehyde-3-phosphate dehydrogenase), P- 
actin, and others. Forward and reverse primers and probes for DPP4 were designed 
using the Perkin Elmer ABI Primer Express™ software and were synthesized by 
TibMolBiol (Berlin, Germany). The DPP4 forward primer sequence was: Primerl (SEQ 
ID NO: 3). The DPP4 reverse primer sequence was Primer2 (SEQ ID NO: 4). Probel 

30 (SEQ ID NO: 5), labelled with FAM (carboxyfluorescein succinimidyl ester) as the 
reporter dye and TAMRA (carboxytetramethylrhodamine) as the quencher, is used as a 
probe for DPP4. The following reagents were prepared in a total of 25 p-1 : lx TaqMan 
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buffer A, 5.5 mM MgCl 2 , 200 nM of dATP, dCTP, dGTP, and dUTP, 0.025 U/jil 
AmpliTaq Gold™, 0.01 U/ jil AmpErase and Probel (SEQ ID NO: 4), DPP4 forward 
and reverse primers each at 200 nM, 200 nM DPP4 FAM/TAMRA-labelled probe, and 5 
Hi of template cDNA. Thermal cycling parameters were 2 min at 50°C, followed by 10 
5 min at 95°C, followed by 40 cycles of melting at 95°C for 15 sec and 
annealing/extending at 60°C foT 1 min. 

Calculation of corrected CT values 

10 The CT (threshold cycle) value is calculated as described in the "Quantitative 
determination of nucleic acids" section. The CF-value (factor for threshold cycle 
correction) is calculated as follows : 

1. PCR reactions were set up to quantitate the housekeeping genes (HKG) for each 
15 cDNA sample. 

2. CTHKG-values (threshold cycle for housekeeping gene) were calculated as 
described in the "Quantitative determination of nucleic acids" section. 

20 3. CTHKG-mean values (CT mean value of all HKG tested on one cDNAs) of all 
HKG for each cDNA are calculated (n = number of HKG): 

CTHKG-n-mean value = (CTHKGrvalue + CTHKG2-value +... + CTHKG-n-value) / n 



25 4. CTpannd mean value (CT mean value of all HKG in all tested cDNAs) = 

(CT H jcGi-niean value + CTHKG2-mean value +...+ CTHKG-y-mean value) / y 
(y = number of cDNAs) 

30 5. CFcdna-o (correction factor for cDNA n) = CTpannei-mean value - CTHKG-n-mean 
value 
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6. CT C DNA-n (CT value of the tested gene for the cDNA n) + CF cD NA-n (correction 
factor for cDNA n) = CTcot^dna-o (corrected CT value for a gene on cDNA n) 

Calculation of relative expression 

5 

Definition : highest CT e0 r<DNA-n * 40 is defined as CTcor-cDNA P»gh] 

Relative Expression = 



Tissues 

The expression of DPP4 was investigated in the tissues in table 1. 



Expression profile 



1 5 The results of the the mRNA-quantification (expression profiling) is shown in Table 1 . 



Table 1: Relative expression of DPP4 in various human tissues. 



Tissue 
Expression 
20 fetal heart 
heart 

pericardium 

heart atrium (right) 

heart atrium (left) 
25 heart ventricle (left) 

heart ventricle (right) 

heart apex 

Purkinje fibers 

interventricular septum 
30 fetal aorta 

aorta 

artery 

coronary artery 
pulmonary artery 



Relative 

18179 
1418 
2557 
3848 
355 
771 
35 
434 
549 
3692 
72 
109 
9 
16 
8 
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carotid artery 
mesenteric artery 
vein 

pulmonic valve 



29 
25 
4 

156 



coronary artery smooth muscle primary cells 5257 
HUVEC cells 2574 



skin 



25709 



10 adrenal gland 
thyroid 
thyroid tumor 
pancreas 

pancreas liver cirrhosis 

15 

esophagus 
esophagus tumor 
stomach 
stomach tumor 
20 colon 

colon tumor 
small intestine 
ileum 

ileum tumor 
25 ileum chronic inflammation 
rectum 

salivary gland 
fetal liver 
liver 

30 liver liver cirrhosis 
liver tumor 
HEP G2 cells 



1833 
357 
236 
605 
224 

133 
3169 
1833 
14165 
3104 
955 
30153 
437827 
657 
8249 
7281 
130167 
2539 
.17199 
1978 
42939 
249728 



leukocytes (peripheral blood) 3468 
35 Jurkat (T-cells) 0 
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10 



15 



bone marrow 
erythrocytes 
lymphnode 
thymus 

thrombocytes 

bone marrow stromal cells 
bone marrow CD71+ cells 
bone marrow CD33+ cells 
bone marrow CD34+ cells 
bone marrow CD15+ cells 
cord blood CD71+ cells 
cord blood CD34+ cells 
neutrophils cord blood 
neutrophils peripheral blood 
spleen 

spleen liver cirrhosis 



249 
434 
176 
2817 
106 
3692 
0 
0 
78 
3 
0 

161 
832 
85 
5149 
1075 



20 



25 



30 



35 



skeletal muscle 
adipose 

fetal brain 
brain 

alzheimer brain 
cerebellum 
cerebellum (right) 
cerebellum (left) 
cerebral cortex 
alzheimer cerebral cortex 
frontal lobe 

alzheimer brain frontal lobe 
occipital lobe 
parietal lobe 
temporal lobe 
precentral gyrus 
postcentral gyrus 



942 
28526 

383 
639 
734 
6 
91 
161 
622 
329 
1820 
1479 
617 
989 
564 
1409 
14 
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tonsilla cerebelli 
vermis cerebelli 
pons 

substantia nigra 
5 cerebral meninges 

cerebral peduncles 

corpus callosum 

hippocampus 

thalamus 
10 dorsal root ganglia 

spinal cord 

neuroblastoma SK-N-MC cells 
neuroblastoma SH-SY5Y cells 
neuroblastoma IMR32 cells 
15 glial tumor H4 cells 

glial tumor H4 cells + APP 

HEK CNS 

HEK CNS + APP 

retina 

20 



fetal lung 9345 

fetal lung fibroblast IMR-90 cells 16845 

fetal lung fibroblast MRC-5 cells 28329 

lung 431 

25 lung right upper lobe 1342 

lung right mid lobe 765 

lung right lower lobe 1380 

lung lupus disease 11426 

lung tumor 112 

30 lung COPD 488 

trachea 2721 

cervix 326 

testis 592 

35 HeLa cells (cervix tumor) 0 



33 
26 
676 
2557 
42 
29 
564 
809 
139 
35 
609 
910 
15 
0 
7 
1 

495 
9675 
56 



WO 2004/104216 



PCT/EP2004/005071 



-99- 

placenta 
uterus 

uterus tumor 
ovary 
5 ovary tumor 
breast 

breast tumor 
MDA MB 231 cells 
mammary gland 

10 



prostata 


62866 


prostate BPH 


1629 


bladder 


1342 


ureter 


630 


penis 


94 


corpus cavernosum 


73 


fetal kidney 


81811 


kidney 


71220 


kidney tumor 


8719 


HEK 293 cells 


4096 



Example 3: Antisense Analysis 

Knowledge of the correct, complete cDNA sequence coding for DPP4 enables its use as 
25 a tool for antisense technology in the investigation of gene function. Oligonucleotides, 
cDNA or genomic fragments comprising the antisense strand of a polynucleotide coding 
for DPP4 are used either in vitro or in vivo to inhibit translation of the mRNA. Such 
technology is now well known in the art, and antisense molecules can be designed at 
various locations along the nucleotide sequences. By treatment of cells or whole test 
30 animals with such antisense sequences, the gene of interest is effectively turned off. 
Frequently, the function of the gene is ascertained by observing behavior at the 
intracellular, cellular, tissue or organismal level (e.g., lethality, loss of differentiated 
function, changes in morphology, etc.). 



13484 

2837 
23494 
2721 
237 
7434 
1184 

(breast tumor) 37 

2195 
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In addition to using sequences constructed to interrupt transcription of a particular open 
reading frame, modifications of gene expression is obtained by designing antisense 
sequences to intron regions, promoter/enhancer elements, or even to trans-acting 
regulatory genes. 

5 

Example 4: Expression of DPP4 

Expression of DPP4 is accomplished by subcloning the cDNAs into appropriate 
expression vectors and transfecting the vectors into expression hosts such as, e.g., is . 

10 coli. In a particular case, the vector is engineered such that it contains a promoter for P- 
galactosidase, upstream of the cloning site, followed by sequence containing the amino- 
terminal Methionine and the subsequent seven residues of p-galactosidase. Immediately 
following these eight residues is an engineered bacteriophage promoter useful for 
artificial priming and transcription and for providing a number of unique endonuclease 

1 5 restriction sites for cloning. 

Induction of the isolated, transfected bacterial strain with Isopropyl-p-D-thio- 
galactopyranoside (DPTG) using standard methods produces a fusion protein 
corresponding to the first seven residues of P-galactosidase, about 15 residues of 

20 "linker", and the peptide encoded within the cDNA. Since cDNA clone inserts are 
generated by an essentially random process, there is probability of 33% that the included 
cDNA will lie in the correct reading frame for proper translation. If the cDNA is not in 
the proper reading frame, it is obtained by deletion or insertion of the appropriate 
number of bases using well known methods including in vitro mutagenesis, digestion 

25 with exonuclease HI or mung bean nuclease, or the inclusion of an oligonucleotide 
linker of appropriate length. 

The DPP4 cDNA is shuttled into other vectors known to be useful for expression of 
proteins in specific hosts. Oligonucleotide primers containing cloning sites as well as a 
30 segment of DNA (about 25 bases) sufficient to hybridize to stretches at both ends of the 
target cDNA is synthesized chemically by standard methods. These primers are then 
used to amplify the desired gene segment by PCR. The resulting gene segment is 
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digested with appropriate restriction enzymes under standard conditions and isolated by 
gel electrophoresis. Alternately, similar gene segments are produced by digestion of the 
cDNA with appropriate restriction enzymes. Using appropriate primers, segments of 
coding sequence from more than one gene are ligated together and cloned in appropriate 
5 vectors. It is possible to optimize expression by construction of such chimeric 
sequences. 

Suitable expression hosts for such chimeric molecules include, but are not limited to, 
mammalian cells such as Chinese Hamster Ovary (CHO) and human 293 cells., insect 

10 cells such as Sf9 cells, yeast cells such as Saccharomyces cerevisiae and bacterial cells 
such as E. colL For each of these cell systems, a useful expression vector also includes 
an origin of replication to allow propagation in bacteria, and a selectable marker such as 
the p-lactamase antibiotic resistance gene to allow plasmid selection in bacteria. In 
addition, the vector may include a second selectable marker such as the neomycin 

15 phosphotransferase gene to allow selection in transfected eukaiyotic host cells. Vectors 
for use in eukaryotic expression hosts require RNA processing elements such as 3* 
polyadenylation sequences if such are not part of the cDNA of interest. 

Additionally, the vector contains promoters or enhancers which increase gene 
20 expression. Such promoters are host specific and include MMTV, SV40, and 
metallothionine promoters for CHO cells; trp, lac, tac and T7 promoters for bacterial 
hosts; and alpha factor, alcohol oxidase and PGH promoters for yeast. T ranscription 
enhancers, such as the rous sarcoma virus enhancer, are used in mammalian host cells. 
Once homogeneous cultures of recombinant cells are obtained through standard culture 
25 methods, large quantities of recombinantly produced DPP4 are recovered from the 
conditioned medium and analyzed using chromatographic methods known in the art. 
For example, DPP4 can be cloned into the expression vector pcDNA3, as exemplified 
herein. This product can be used to transform, for example, HEK293 or COS by 
methodology standard in the art. Specifically, for example, using Lipofectamine (Gibco 
30 BRL catolog no. 1 8324-020) mediated gene transfer. 
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Example 5: Isolation of Recombinant DPP4 

DPP4 is expressed as a chimeric protein with one or more additional polypeptide 
domains added to facilitate protein purification. Such purification facilitating domains 

5 include, but are not limited to, metal chelating peptides such as histidine-tryptophan 
. modules that allow purification on immobilized metals [Appa Rao, 1997] and the 
domain utilized in the FLAGS extension/affinity purification system (Immunex Corp., 
Seattle, Washington). The inclusion of a cleavable linker sequence such as Factor Xa or 
enterokinase (Invitrogen, Groningen, The Netherlands) between the purification domain 

10 and the DPP4 sequence is useful to facilitate expression of DPP4. 

The following example provides a method for purifying DPP4. 

DPP4 is generated using the baculovirus expression system BAC-TO-BAC (GIBCO 
15 BRL) based on Autographa calif ornica nuclear polyhedrosis virus (AcNPV) infection of 
Spodoptera Jrugiperda insect cells (Sf9 cells). 

cDNA encoding proteases cloned into either the donor plasmid pFASTBACl or 
pFASTBAC-HT which contain a mini-Tn7 transposition element. The recombinant 

20 plasmid is transformed into DH 1 0B AC competent cells which contain the parent bacmid 
bMON14272 (AcNPV infectious DNA) and a helper plasmid. The mini-Tn7 element on 
the pFASTBAC donor can transpose to the attTn7 attachment site on the bacmid thus 
introducing the protease gene into the viral genome. Colonies containing recombinant 
bacmids are identified by d isruption of the lacZ gene. The protease/bacmid construct 

25 can then be isolated and infected into insect cells (Sf9 cells) resulting in the production 
of infectious recombinant baculovirus particles and expression of either unfused 
recombinant enzyme (pFastbacl) or DPP4-His fusion protein (pFastbacHT). 

Cells are harvested and extracts prepared 24, 48 and 72 hours after transfection. 
30 Expression of DPP4 is confirmed by coomassie staining after sodium dodecyl sulphate- 
polyacrylamide gel electrophoresis (SDS-PAGE) and western blotting onto a PVDF 
membrane of an unstained SDS-PAGE. The protease-His fusion protein is detected due 
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to the interaction between the Ni-NTA HRP conjugate and the His-tag which is fused to 
DPP4. 

Example 6: Production of DPP4 Specific Antibodies 

5 

Two approaches are utilized to raise antibodies to DPP4, and each approach is useful for 
generating either polyclonal or monoclonal antibodies. In one approach, denatured 
protein from reverse phase HPLC separation is obtained in quantities up to 75 mg. This 
denatured protein is used to immunize mice or rabbits using standard protocols; about 
10 100 \x% are adequate for immunization of a mouse, while up to 1 mg might be used to 
immunize a rabbit. For identifying mouse hybridomas, the denatured protein is 
radioiodinated and used to screen potential murine B-cell hybridomas for those which 
produce antibody. This procedure requires only small quantities of protein, such that 20 
mg is sufficient for labeling and screening of several thousand clones. 

15 

In the second approach, the amino acid sequence of an appropriate DPP4 domain, as 
deduced from translation of the cDNA, is analyzed to determine regions of high 
antigenicity. Oligopeptides comprising appropriate hydrophilic regions are synthesized 
and used in suitable immunization protocols to raise antibodies. The optimal amino acid 
20 sequences for immunization are usually at the C-terminus, the N-terminus and those 
intervening, hydrophilic regions of the polypeptide which are likely to be exposed to the 
external environment when the protein is in its natural conformation. 

Typically, selected peptides, about 15 residues in length, are synthesized using an 
25 Applied Biosystems Peptide Synthesizer Model 43 1 A using finoc-chemistry and coupled 
to keyhole limpet hemocyanin (KLH; Sigma, St. Louis, MO) by reaction with M- 
maleimidobenzoyl-N-hydroxysuccinimide ester, MBS. If necessary, a cysteine is 
introduced at the N-terminus of the peptide to permit coupling to KLH. R abbits are 
immunized with the peptide-KLH complex in complete Freund's adjuvant. The resulting 
30 antisera are tested for antipeptide activity by binding the peptide to plastic, blocking 
with 1% bovine serum albumin, reacting with antisera, washing and reacting with 
labeled (radioactive or fluorescent), affinity purified, specific goat anti-rabbit IgG. 
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Hybridomas are prepared and screened using standard techniques. Hybridomas of 
interest are detected by screening with labeled DPP4 to identify those fusions producing 
the monoclonal antibody with the desired specificity. In a typical protocol, wells of 

5 plates (FAST; Becton-Dickinson, Palo Alto, CA) are coated during incubation with 
affinity purified, specific rabbit anti-mouse (or suitable antispecies 1 g) antibodies at 10 
mg/ml. The coated wells are blocked with 1% bovine serum albumin, (BSA), washed 
and incubated with supernatants from hybridomas. After washing the wells are 
incubated with labeled DPP4 at 1 mg/ml. Supernatants with specific antibodies bind 

10 more labeled DPP4 than is detectable in the background. Then clones producing 
specific antibodies are expanded and subjected to two cycles of cloning at limiting 
dilution. Cloned hybridomas are injected into pristane- treated mice to produce ascites, 
and monoclonal antibody is purified from mouse ascitic fluid by affinity 
chromatography on Protein A. Monoclonal antibodies with affinities of at least 

15 

10 8 M"\ preferably 10 9 to 10 10 M* 1 or stronger, are typically made by standard 
procedures. 

Example 7: Diagnostic Test Usinjg DPP4 Specific Antibodies 

20 

Particular DPP4 antibodies are useful for investigating signal transduction and the 
diagnosis of infectious or hereditary conditions which are characterized by differences in 
the amount or distribution of DPP4 or downstream products of an active signaling 
cascade. 

25 

Diagnostic tests for DPP4 include methods utilizing antibody and a label to detect DPP4 
in human body fluids, membranes, cells, tissues or extracts of such. The polypeptides 
and antibodies of the present invention are used with or without modification. 
Frequently, the polypeptides and antibodies are labeled by joining them, either 
30 covalently or noncovalently, with a substance which provides for a detectable signal, A 
wide variety of labels and conjugation techniques are known and have been reported 
extensively in both the scientific and patent literature. Suitable labels include 
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radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent agents, 
chemiluminescent agents, chromogenic agents, magnetic particles and the like. 

A variety of protocols for measuring soluble or membrane-bound DPP4, using either 
5 polyclonal or monoclonal antibodies specific for the protein, are known in the art. 
Examples include enzyme-linked immunosorbent assay (ELISA), radioimmunoassay 
(RIA) and fluorescent activated cell sorting (FACS). A two-site monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two non-interfering epitopes 
on DPP4 is preferred, but a competitive binding assay may be employed. 

10 

Example 8: Purification of Native DPP4 Using Specific Antibodies 

Native or recombinant DPP4 is purified by immunoaffinity chromatography using 
antibodies specific for DPP4. In general, an immunoaffinity column is constructed by 
1 5 covalently coupling the anti-TRH antibody to an activated chromatographic resin. 

Polyclonal immunoglobulins are prepared from immune sera either by precipitation with 
ammonium sulfate or by purification on immobilized Protein A (Pharmacia LKB Bio- 
technology, Piscataway N.J.). Likewise, monoclonal antibodies are prepared from 
20 mouse ascites fluid by ammonium sulfate precipitation or chromatography on 
immobilized Protein A. Partially purified immunoglobulin is covalently attached to a 
chromatographic resin such as CnBr-activated Sepharose (Pharmacia LKB 
Biotechnology). The antibody is coupled to the resin, the resin is blocked, and the 
derivative resin is washed according to the manufacturers instructions. 

25 

Such immunoaffinity columns are utilized in the purification of DPP4 by preparing a 
fraction from cells containing DPP4 in a soluble form. This preparation is derived by 
solubilization of whole cells or of a subcellular fraction obtained via differential 
centrifugation (with or without addition of detergent) or by other methods well known in 
30 the art. Alternatively, soluble DPP4 containing a signal sequence is secreted in useful 
quantity into the medium in which the cells are grown. 
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A soluble DPP4-containing preparation is passed over the immunoaffinity column, and 
the column is washed under conditions that allow the preferential absorbance of DPP4 
(e.g., high ionic strength buffers in the presence of detergent). Then, the column is 
eluted under conditions that disrupt antibody/protein binding (e.g., a buffer of pH 2-3 or 
5 a high concentration of a chaotrope such as urea or thiocyanate ion), and DPP4 is 
collected 

Example 9: Drug Screening 

10 This invention is particularly useful for screening therapeutic compounds by using DPP4 
or fragments thereof in any of a variety of drug screening techniques. 

The following example provides a system for drug screening measuring the protease 
activity. 

15 

The recombinant protease-His fusion protein can be purified from the crude lysate by 
metal-affinity chromatography using Ni-NTA agarose. This allows the specific retention 
of the recombinant material (since this is fused to the His-tag) whilst the endogenous 
insect proteins are washed off. The recombinant material is then eluted by competition 
20 with imidazol. 

The activity of DPP4 molecules of the present invention can be measured using a variety 
of assays that measure DPP4 activity. For example, DPP4 enzyme activity can be 
assessed by a standard in vitro serine/metallo/... protease assay (see, for example, [U.S. 

25 5,057,414]). Those of skill in the art are aware of a variety of substrates suitable for in 
vitro assays, such as SucAla-Ala-Pro-Phe-pNA, fluorescein mono-p-guanidinobenzoate 
hydrochloride, benzyloxycarbonyl-L-Arginyl-S-benzylester, Nalpha-Benzoyl-L-arginine 
ethyl ester hydrochloride, and the like. In addition, protease assay kits available from 
commercial sources, such as Calbiochem™ (San Diego, Calif.). For general references, 

30 see B arrett (Ed.), M ethods i n E nzymology, P roteolytic Enzymes: Serine and Cysteine 
Peptidases (Academic Press Inc. 1994), and Barrett et al., (Eds.), Handbook of 
Proteolytic Enzymes (Academic Press Inc. 1998). 
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Example 10: Rational Drug Design 

The goal of rational drug design is to produce structural analogs of biologically active 
5 polypeptides of interest or of small molecules with which they interact, agonists, 
antagonists, or inhibitors. Any of these examples are used to fashion drugs which are 
more active or stable forms of the polypeptide or which enhance or interfere with the 
function of a polypeptide in vivo. 

10 In one approach, the three-dimensional structure of a protein of interest, or of a protein- 
inhibitor complex, is determined by x-ray crystallography, by computer modeling or, 
most typically, by a combination of the two approaches. Both the shape and charges of 
the polypeptide must be ascertained to elucidate the structure and to determine active 
site(s) of the molecule. Less often, useful information regarding the structure of a 

15 polypeptide is gained by modeling based on the structure of homologous proteins. In 
both cases, relevant structural information is used to design efficient inhibitors. Useful 
examples of rational drug design include molecules which have improved activity or 
stability or which act as inhibitors, agonists, or antagonists of native peptides. 

20 It is also possible to isolate a target-specific antibody, selected by functional assay, as 
described above, and then to solve its crystal structure. This approach, in principle, 
yields a pharmacore upon which subsequent drug design is based. It is possible to 
bypass protein crystallography altogether by generating anti-idiotypic antibodies (anti- 
ids) to a functional, pharmacologically active antibody. As a mirror image of a mirror 

25 image, the binding site of the anti-ids is expected to be an analog of the original 
receptor. The anti-id is then used to identify and isolate peptides from banks of 
chemically or biologically produced peptides. The isolated peptides then act as the 
pharmacore. 

30 By virtue of the present invention, sufficient amount of polypeptide are made available 
to perform such analytical studies as X-ray crystallography. In addition, knowledge of 
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the DPP4 amino acid sequence provided herein provides guidance to those employing 
computer modeling techniques in place of or in addition to x-ray crystallography. - 

Example lit Identification of Other Members of the Signal Transduction Complex 

5 

Labeled DPP4 is useful as a reagent for the purification of molecules with which it 
interacts. In one embodiment of affinity purification, DPP4 is covalently coupled to a 
chromatography column. Cell-free extract derived from synovial cells or putative target 
cells is passed over the column, and molecules with appropriate affinity bind to DPP4. 
10 DPP4-complex is recovered from the column, and the DPP4-binding ligand 
disassociated and subjected to N-terminal protein sequencing. The amino acid sequence 
information is then used to identify the captured molecule or to design degenerate 
oligonucleotide probes for cloning the relevant gene from an appropriate cDNA library. 

15 In an alternate method, antibodies are raised against DPP4, specifically monoclonal 
antibodies. The monoclonal antibodies are screened to identify those which inhibit the. 
binding of labeled DPP4. These monoclonal antibodies are then used therapeutically. 

Example 12: Use and Administration of Antibodies. Inhibitors, or Antagonists 

20 

Antibodies, inhibitors, or antagonists of DPP4 or other treatments and compunds that are 
limiters of signal transduction (LSTs), provide different effects when administered 
therapeutically. LSTs are formulated in a nontoxic, inert, pharmaceutical^ acceptable 
aqueous carrier medium preferably at a pH of about 5 to 8, more preferably 6 to 8, 

25 although pH may vary according to the characteristics of the antibody, inhibitor, or 
antagonist being formulated and the condition to be treated. C haracteristics of LSTs 
include solubility of the molecule, its half-life and antigenicity/immunogenicity. These 
and other characteristics aid in defining an effective carrier. Native human proteins are 
preferred as LSTs, but organic or synthetic molecules resulting from drug screens are 

30 equally effective in particular situations. 
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LSTs are delivered by known routes of administration including but not limited to 
topical creams and gels; transmucosal spray and aerosol; transdermal patch and 
bandage; injectable, intravenous and lavage formulations; and orally administered 
liquids and pills particularly formulated to resist stomach acid and enzymes. The 
5 particular formulation, exact dosage, and route of administration is determined by the 
attending physician and varies according to each specific situation. 

Such determinations are made by considering multiple variables such as the condition to 
be treated, the LST to be administered, and the pharmacokinetic profile of a particular 

10 LST. A dditional factors which are taken into account include severity of the disease 
state, patient's age, weight, gender and diet, time and frequency of LST administration, 
possible combination with other drugs, reaction sensitivities, and tolerance/response to 
therapy. Long acting LST formulations might be administered every 3 to 4 days, every 
week, or once every two weeks depending on half-life and clearance rate of the 

15 particular LST. 

Normal dosage amounts vary from 0.1 to 10 5 ^g, up to a total dose of about 1 g, 
depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature; see U.S. Pat. Nos. 4,657,760; 
20 5,206,344; or 5,225,212. Those skilled in the art employ different formulations for 
different LSTs. Administration to c ells s uch a s n erve c ells n ecessitates d elivery i n a 
manner different from that to other cells such as vascular endothelial cells. 

It is contemplated that abnormal signal transduction, trauma, or diseases which trigger 
25 DPP4 activity are treatable with LSTs. These conditions or diseases are specifically 
diagnosed by the tests discussed above, and such testing should be performed in 
suspected cases of viral, b acterial or fungal infections, allergic responses, mechanical 
injury associated with trauma, hereditary diseases, lymphoma or carcinoma, or other 
conditions which activate the genes of lymphoid or neuronal tissues. 



30 
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Example 13: Production of Non-human Transgenic Animals 

Animal model systems which elucidate the physiological and behavioral roles of the 
DPP4 are produced by creating nonhuman transgenic animals in which the activity of 

5 the DPP4 is either increased or decreased, or the amino acid sequence of the expressed 
PPP4 is altered, by a variety of techniques. Examples of these techniques include, but 
are not limited to: 1) Insertion of normal or mutant versions of DNA encoding a DPP4, 
by microinjection, electroporation, retroviral transfection or other means well known to 
those skilled in the art, into appropriately fertilized embryos in order to produce a 

10 transgenic animal or 2) homologous recombination of mutant or normal, human or 
animal versions of these genes with the native gene locus in transgenic animals to alter 
the regulation of expression or the structure of these DPP4 sequences. The technique of 
homologous recombination is well known in the art. It replaces the native gene with the 
inserted gene and hence is useful for producing an animal that cannot express native 

15 DPP4s but does express, for example, an inserted mutant DPP4, which has replaced the 
native DPP4 in the animal's genome by recombination, resulting in underexpression of 
the transporter. Microinjection adds genes to the genome, but does not remove them, 
and the technique is useful for producing an animal which expresses its own and added 
DPP4, resulting in overexpression of the DPP4. 

20 

One means available for producing a transgenic animal, with a mouse as an example, is 
as follows: Female mice are mated, and the resulting fertilized eggs are dissected out of 
their oviducts. The eggs are stored in an appropriate medium such as cesiumchloride 
M2 medium. DNA or cDNA encoding DPP4 is purified from a vector by methods well 

25 known to the one skilled in the art. Inducible promoters may be fused with the coding 
region of the DNA to provide an experimental means to regulate expression of the 
transgene. Alternatively or in addition, tissue specific regulatory elements may be fused 
with the coding region to permit tissue-specific expression of the transgene. The DNA, 
in an appropriately buffered solution, is put into a microinjection needle (which may be 

30 made from capillary tubing using a piper puller) and the egg to be injected is put in a 
depression slide. The needle is inserted into the pronucleus of the egg, and the DNA 
solution is injected. The injected egg is then transferred into the oviduct of a 
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pseudopregnant mouse which is a mouse stimulated by the appropriate hormones in 
order to maintain false pregnancy, where it proceeds to the uterus, implants, and 
develops to term. As noted above, microinjection is not the only method for inserting 
DNA into the egg but is used here only for exemplary purposes. 

5 
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Claims 

1. A method of screening for therapeutic agents useful in the treatment of a disease 
comprised in a group of diseases consisting of cardiovascular disorders, 
dermatological disorders, cancer disorders, hematological disorders, respiratory 
diseases, gastrointestinal and liver diseases, urological disorders and metabolic 
diseases in a mammal comprising the steps of 

i) contacting a test compound with a DPP4 polypeptide, 

ii) detect binding of said test compound to said DPP4 polypeptide. 



2. A method of screening for therapeutic agents useful in the treatment of a disease 
comprised in a group of diseases consisting of cardiovascular disorders, 
15 dermatological disorders, cancer disorders, hematological disorders, respiratory 

diseases, gastrointestinal and liver diseases, urological disorders and metabolic 
diseases in a mammal comprising the steps of 

i) determining the activity of a DPP4 polypeptide at a certain concentration 
20 of a test compound or in the absence of said test compound, 

ii) determining the activity of said polypeptide at a different concentration 
of said test compound. 

25 3. A method of screening for therapeutic agents usefiil in the treatment of a disease 
comprised in a group of diseases consisting of cardiovascular disorders, 
dermatological disorders, cancer disorders, hematological disorders, respiratory 
diseases, gastrointestinal and liver diseases, urological disorders and metabolic 
diseases in a mammal comprising the steps of 



i) determining the activity of a DPP4 polypeptide at a certain concentration 
of a test compound, 
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ii) determining the activity of a DPP4 polypeptide at the presence of a 
compound known to be a regulator of a DPP4 polypeptide. 

5 4. The method of any of claims 1 to 3, wherein the step of contacting is in or at the 
surface of a cell. 

5. The method of any of claims 1 to 3, wherein the cell is in vitro. 

10 6. The method of any of claims 1 to 3, wherein the step of contacting is in a cell- 
free system. 

7. The method of any of claims 1 to 3, wherein the polypeptide is coupled to a 
detectable label. 

15 

8. The method of any of claims 1 to 3, wherein the compound is coupled to a 
detectable label. 

9. The method of any of claims 1 to 3, wherein the test compound displaces a 
20 ligand which is first bound to the polypeptide. 

10. The method of any of claims 1 to 3 , wherein the polypeptide is attached to a 
solid support. 

25 11. The method of any of claims 1 to 3, wherein the compound is attached to a solid 
support. 

12. A method of screening for therapeutic agents useful in the treatment of a disease 
comprised in a group of diseases consisting of cardiovascular disorders, 
30 dermatological disorders, cancer disorders, hematological disorders, respiratory 

diseases, gastrointestinal and liver diseases, urological disorders and metabolic 
diseases in a mammal comprising the steps of 
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i) contacting a test compound with a DPP4 polynucleotide, 

ii) detect binding of said test compound to said DPP4 polynucleotide, 

13. The method of claim 12 wherein the nucleic acid molecule is RNA. 

14. The method of claim 12 wherein the contacting step is in or at the surface of a 
cell. 

15. The method of claim 12 wherein the contacting step is in a cell-free system. 

16. The method of claim 12 wherein polynucleotide is coupled to a detectable label. 



15 17. The method of claim 12 wherein the test compound is coupled to a detectable 
label. 

18. A method of diagnosing a disease comprised in a group of diseases consisting of 
cardiovascular disorders, dermatological disorders, cancer disorders, 

20 hematological disorders, respiratory diseases, gastrointestinal and liver diseases, 

urological disorders and metabolic diseases in a mammal comprising the steps of 

i) determining the amount of a DPP4 polynucleotide in a sample taken 
from said mammal, 

25 

ii) determining the amount of DPP4 polynucleotide in healthy and/or 
diseased mammals. 

19. A pharmaceutical composition for the treatment of a disease comprised in a 
30 group of diseases consisting of cardiovascular disorders, dermatological 

disorders, cancer disorders, hematological disorders, respiratory diseases, 
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SEQUENCE LISTING 
<X10> Bayer AG, BHC 

<120> Diagnostics and Therapeutics for Diseases Associated with 

Dipeptidylpeptidase IV (DPP4 } 

<130> Le A 36 738 

<160> 5 

<170> Patentln version 3.1 

t 

<210> 1 

<211> 3407 

<212> DNA 

<213> Homo sapiens 

<400> 1 

cgcgcgtctc cgccgcccgc gtgacttctg cctgcgctcc ttctctgaac gctcacttcc 60 

gaggagacgc cgacgatgaa gacaccgtgg aagattcttc tgggactgct gggtgctgct 120 

gcgcttgtca ccatcatcac cgtgcccgtg gttctgctga acaaaggcac agatgatgct 180 

acagctgaca gtcgcaaaac ttacactcta actgattact taaaaaatac ttatagactg 240 

aagttatact ccttaagatg gatttcagat catgaatatc tctacaaaca agaaaataat 300 

atcttggtat tcaatgctga atatggaaac agctcagttt tcttggagaa cagtacattt 360 

gatgagtttg gacattctat caatgattat tcaatatctc ctgatgggca gtttattctc 420 

ttagaataca actacgtgaa gcaatggagg cattcctaca cagcttcata tgacatttat 480 

gatttaaata aaaggcagct gattacagaa gagaggattc caaacaacac acagtgggtc 540 

acatggtcac cagtgggtca taaattggca tatgtttgga acaatgacat ttatgttaaa 600 

attgaaccaa atttaccaag ttacagaatc acatggacgg ggaaagaaga tataatatat 660 

aatggaataa ctgactgggt ttatgaagag gaagtcttca gtgcctactc tgctctgtgg 720 

tggtctccaa acggcacttt tttagcatat gcccaattta acgacacaga agtcccactt 780 

attgaatact ccttctactc tgatgagtca ctgcagtacc caaagactgt acgggttcca 840 

tatccaaagg caggagctgt gaatccaact gtaaagttct ttgttgtaaa tacagactct 900 

ctcagctcag tcaccaatgc aacttccata caaatcactg ctcctgcttc tatgttgata 960 

ggggatcact acttgtgtga tgtgacatgg gcaacacaag aaagaatttc tttgcagtgg 1020 

ctcaggagga ttcagaacta ttcggtcatg gatatttgtg actatgatga atccagtgga 1080 

agatggaact gcttagtggc acggcaacac attgaaatga gtactactgg ctgggttgga 1140 

agatttaggc cttcagaacc tcattttacc cttgatggta atagcttcta caagatcatc 1200 

agcaatgaag aaggttacag acacatttgc tatttccaaa tagataaaaa agactgcaca 1260 

tttattacaa aaggcacctg ggaagtcatc gggatagaag ctctaaccag tgattatcta 1320 

tactacatta gtaatgaata taaaggaatg ccaggaggaa ggaatcttta taaaatccaa 1380 

cttattgact atacaaaagt gacatgcctc agttgtgagc tgaatccgga aaggtgtcag 1440 

tactattctg tgtcattcag taaagaggcg aagtattatc agctgagatg ttccggtcct 1500 

ggtctgcccc tctatactct acacagcagc gtgaatgata aagggctgag agtcctggaa 1560 

gacaattcag ctttggataa aatgctgcag aatgtccaga tgccctccaa aaaactggac 1620 

ttcattattt tgaatgaaac aaaattttgg tatcagatga tcttgcctcc tcattttgat 1680 

aaatccaaga aatatcctct actattagat gtgtatgcag gcccatgtag tcaaaaagca 1740 
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gacactgtct tcagactgaa ctgggccact taccttgcaa gcacagaaaa cattatagta 1800 

gctagctttg atggcagagg aagtggttac caaggagata agatcatgca tgcaatcaac 1860 

agaagactgg gaacatttga agttgaagat caaattgaag cagccagaca attttcaaaa 1920 

atgggatttg tggacaacaa acgaattgca atttggggct ggtcatatgg agggtacgta 1980 

acctcaatgg tcctgggatc gggaagtggc gtgttcaagt gtggaatagc cgtggcgcct 2040 

gtatcccggt gggagtacta tgactcagtg tacacagaac gttacatggg tctcccaact 2100 

ccagaagaca accttgacca ttacagaaat tcaacagtca tgagcagagc tgaaaatttt 2160 

aaacaagttg agtacctcct tattcatgga acagcagatg ataacgttca ctttcagcag 2220 

tcagctcaga tctccaaagc cctggtcgat gttggagtgg atttccaggc aatgtggtat 2280 

actgatgaag accatggaat agctagcagc acagcacacc aacatatata tacccacatg 2340 

agccacttca taaaacaatg tttctcttta ccttagcacc tcaaaatacc atgccattta 2400 

aagcttatta aaactcattt ttgttttcat tatctcaaaa ctgcactgtc aagatgatga 2460 

tgatctttaa aatacacact caaatcaaga aacttaaggt tacctttgtt cccaaafcttc 2520 

atacctatca tcttaagtag ggacttctgt cttcacaaca gattattacc ttacagaagt 2580 

ttgaattatc cggtcgggtt ttattgttta aaatcatttc tgcatcagct gctgaaacaa 2640 

caaataggaa ttgtttttat ggaggctttg catagattcc ctgagcagga ttttaatctt 2700 

tttctaactg gactggttca aatgttgttc tcttctttaa agggatggca agatgtgggc 2760 

agtgatgtca ctagggcagg gacaggataa gagggattag ggagagaaga tagcagggca 2620 

tggctgggaa cccaagtcca agcataccaa cacgagcagg ctactgtcag ctcccctcgg 2880 

agaagagctg ttcaccacga gactggcaca gttttctgag aaagactatt caaacagtct 2940 

caggaaatca aatatcgaaa gcactgactt ctaagtaaac cacagcagtt gaaagactcc 3000 

aaagaaatgt aagggaaact gccagcaacg cagcccccag gtgccagtta tggctatagg 3060 

tgctacaaaa acacagcaag ggtgatggga aagcattgta aatgtgcttt taaaaaaaaa 3120 

tactgatgtt cctagtgaaa gaggcagctt gaaactgaga tgtgaacaca tcagcttgcc 3180 

ctgttaaaag atgaaaatat ttgtatcaca aatcttaact tgaaggagtc cttgcatcaa 3240 

tttttcttat ttcatttctt tgagtgtctt aattaaaaga atattttaac ttccttggac 3300 

tcattttaaa aaatggaaca taaaatacaa tgttatgtat tattattccc attctacata 3360 

ctatggaatt tctcccagtc atttaataaa tgtgccttca ttttttc 3407 

<210> 2 

<211> 766 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Lys Thr Pro Trp Lys Val Leu Leu Gly Leu Leu Gly Ala Ala Ala 

15 10 15 

Leu Val Thr lie He Thr Val Pro Val Val Leu Leu Asn Lys Gly Thr 

20 25 30 

Asp Asp Ala Thr Ala Asp Ser Arg Lys Thr Tyr Thr Leu Thr Asp Tyr 

35 40 45 

Leu Lys Asn Thr Tyr Arg Leu Lys Leu Tyr Ser Leu Arg Trp He Ser 

50 55 60 

Asp His Glu Tyr Leu Tyr Lys Gin Glu Asn Asn He Leu Val Phe Asn 
65 70 75 80 

Ala Glu Tyr Gly Asn Ser Ser Val Phe Leu Glu Asn Ser Thr Phe Asp 
85 90 95 
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Glu Phe Gly His Ser He Asn Asp Tyr Ser He Ser Pro Asp Gly Gin 

100 105 110 

Phe He Leu Leu Glu Tyr Asn Tyr Val Lys Gin Trp Arg His Ser Tyr 

115 120 125 

Thr Ala Ser Tyr Asp He Tyr Asp Leu Asn Lys Arg Gin Leu He Thr 

130 135 140 

Glu Glu Arg lie Pro Asn Asn Thr Gin Trp Val Thr Trp Ser Pro Val 
145 150 155 160 

Gly His Lys Leu Ala Tyr Val Trp Asn ABn Asp He Tyr Val Lys lie 

165 170 175 

Glu Pro Asn Leu Pro Ser Tyr Arg He Thr Trp Thr Gly Lys Glu Asp 

180 185 190 

He He Tyr Asn Gly He Thr Asp Trp Val Tyr Glu Glu Glu Val Phe 

195 200 205 

Ser Ala Tyr Ser Ala Leu Trp Trp Ser Pro Asn Gly Thr Phe Leu Ala 

210 215 220 

Tyr Ala Gin Phe Asn Asp Thr Glu Val Pro Leu He Glu Tyr Ser Phe 
225 230 235 240 

Tyr Ser Asp Glu Ser Leu Gin Tyr Pro Lys Thr Val Arg Val Pro Tyr 

245 250 255 

Pro Lys Ala Gly Ala Val Asn Pro Thr Val Lys Phe Phe Val Val Asn 

260 265 270 

Thr Asp Ser Leu Ser Ser Val Thr Asn Ala Thr Ser He Gin lie Thr 

275 280 285 

Ala Pro Ala Ser Met Leu He Gly Asp His Tyr Leu Cys Asp Val Thr 

290 295 300 

Trp Ala Thr Gin Glu Arg He Ser Leu Gin Trp Leu Arg Arg He Gin 
305 310 315 320 

Asn Tyr Ser Val Met Asp He Cys Asp Tyr Asp Glu Ser Ser Gly Arg 

325 330 335 

Trp Asn Cys Leu Val Ala Arg Gin His He Glu Met Ser Thr Thr Gly 

340 345 350 

Trp Val Gly Arg Phe Arg Pro Ser Glu Pro His Phe Thr- Leu Asp Gly 

355 360 365 

Asn Ser Phe Tyr Lys He He Ser Asn Glu Glu Gly Tyr Arg His lie 

370 375 380 

Cys Tyr Phe Gin He Asp Lys Lys Asp Cys Thr Phe He Thr Lys Gly 
385 390 395 400 

Thr Trp Glu Val He Gly He Glu Ala Leu Thr Ser Asp Tyr Leu Tyr 

405 410 415 

Tyr lie Ser Asn Glu Tyr Lys Gly Met Pro Gly Gly Arg Asn Leu Tyr 

420 425 430 

Lys He Gin Leu Ser Asp Tyr Thr Lys Val Thr Cys Leu Ser Cys Glu 

435 440 44S 

Leu Asn Pro Glu Arg Cys Gin Tyr Tyr Ser Val Ser Phe Ser Lys Glu 

450 455 460 

Ala Lys Tyr Tyr Gin Leu Arg Cys Ser Gly Pro Gly Leu Pro Leu Tyr 
465 470 475 480 
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Thr Leu Hi 8 Ser Ser V al Asn Asp Lys Gly Leu Arg Val Leu Glu 



485 



Asp 



490 



495 



Asn Ser Ala Leu Asp Lys Met Leu Gin Asn Val Gin Met Pro Ser Lys 



500 



505 



510 



Lys Leu Asp Phe lie He Leu Asn Glu Thr Lys Phe Trp Tyr Gin Met 
515 520 S2S 

He Leu Pro Pro His Phe Asp Lys Ser Lye Lys Tyr Pro Leu Leu Leu 
530 jag M0 

Asp Val Tyr Ala Gly Pro Cys Ser Gin Lys Ala Asp Thr Val Phe Arg 
545 550 555 560 

Leu Asn Trp Ala Thr Tyr Leu Ala Ser Thr Glu Asn lie lie Val Ala 

565 570 S7 5 

Ser Phe Asp Gly Arg Gly Ser Gly Tyr Gin Gly Asp Lys lie Met His 

580 585 S90 

Ala He Asn Arg Arg Leu Gly Thr Phe Glu Val Glu Asp Gin He Glu 

S9S 600 
Ala Ala Arg Gin Phe Ser Lys Met Gly Phe Val Asp Asn Lys Arg lie 

610 «S «0 

Ala He Trp Gly Trp Ser Tyr Gly Gly Tyr Val Thr Ser Met Val Leu 
" 5 630 «5 640 

Gly Ser Gly Ser Gly Val Phe Lys Cys Gly He Ala Val Ala Pro Val 

645 650 6 55 

Ser Arg Trp Glu Tyr Tyr Asp Ser Val Tyr Thr Glu Arg Tyr Met Gly 

660 665 670 

Leu Pro Thr Pro Glu Asp Asn Leu Asp His Tyr Arg Asn Ser Thr Val 

675 680 68s 

Met Ser Arg Ala Glu Asn Phe Lys Gin Val Glu Tyr Leu Leu He His 

690 695 700 

Gly Thr Ala Asp Asp Asn Val His Phe Gin Gin Ser Ala Gin He Ser 

Lys Ala Leu Val Asp Val Gly Val Asp Phe Gin Ala Met Trp Tyr Thr 

725 ?30 ,35 

Asp Glu Asp His Gly He Ala Ser Ser Thr Ala His Gin His He Tyr 

740 '« 750 

Thr His Met Ser His Phe He Lys Gin Cys Phe Ser Leu Pro 
755 760 76s 
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<210> 



3 



<211> 



20 



<212> 



DNA 



<213> 



artificial sequence 



<220> 



<223> 



forward primer 



<400> 



3 



tgaatccagt ggaagatgga 



20 



<210> 4 

<211> 20 

<212> DNA 

<213> artificial sequence 
<220> 

<2 2 3 > reverse * primer 

<400> 4 

tcttccaacc cagccagtag 20 



<211> 24 

<212> DNA 

<213> artificial sequence 
<220> 

<223> probe 

<400> 5 

actgcttagt ggcacggcaa caca 24 



<210> 



5 
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